Document made available under 
Patent Cooperation Treaty (PCT) 



International application number: PCT/EP2004/011577 
International filing date: 14 October 2004 (14.10.2004) 

Document type: Certified copy of priority document 

Document details: Country/Office: DE 

Number: 103 48 407.8 

Filing date: 17 October 2003 (17.10.2003) 



Date of receipt at the International Bureau: 19 January 2005 (19.01.2005) 



Remark: Priority document submitted or transmitted to the International Bureau in 
compliance with Rule 17.1(a) or (b) 




World Intellectual Property Organization (WIPO) - Geneva, Switzerland 
Organisation Mondiale de la Propriete lntellectuelle (OMPI) - Geneve, Suisse 



BUNDESREPUBLIK^DEUTSCHLANP 




Prioritatsbescheinigung iiber die Einreichung 
einer Patentanmeldung 



Aktenzeichen: 
Anmeldetag: 

i 

Anmelder/lnhaber: 



Bezeichnung: 



103 48 407.8 



17. Oktober2003 



Prof . Martin Widschwendter, Mieming/AT 
Erstanmelder: Epigenomics AG, 10435 Berlin/DE 

Prognostic and Diagnostic Markers Markers for cell 
. proliferative disorders of the breast tissues 



IPC: 



E 03 B 11/16 




Die angehefteten Stucke sind eine richtige und genaue Wiedergabe der ur- 
sprunglichen Unterlagen dieser Patentanmeldung. 



Munchen, den 29. Dezember.2004 
Deutsches Patent- und Markenamt 
Der Pradfrtent _ 



Epigenomics AG 
E30120 



Prognostic and Diagnostic Markers for cell proliferative disorders of the breast tissues 

The present invention relates to prognostic and diagnostic markers for cell proliferative disor- 
ders of the breast tissues. The present invention therefore provides methods and nucleic acids 
for the analysis of biological samples for features associated with the development of breast 
cell proliferative disorders. 

Background of the invention 

. This patent application relates to diagnosis and prognosis of breast cancer. Today, involvement 
of axillary lymph nodes and tumour size are the most important prognostic factors in . breast 
cajacer. Although the presence or absence of metastatic involvement in the axillary lymph 
nodes is the most powerful prognostic factor available for patients with primary breast cancer, 
it is only an indirect measure reflecting the tumours' tendency to spread. In approximately 
one-third of women with breast cancer and negative lymph nodes the disease recurs, while 
about one-third of patients with positive lymph nodes are free of recurrence ten years after 
. loco-regional therapy. These data highlight the need for more sensitive and specific prognos- 
tic indicators, ideally reflecting the presence or absence of tumour-specific alterations in the 
bloodstream that may eventually even after years lead to metastasis. It is now widely accepted 
that adjuvant systemic therapy substantially improves disease-free and overall survival in both 
pre- and postmenopausal women up to the age of 70 years with lymph node-negative or 
lymph node-positive breast cancer (Early Breast Cancer Trialists' Collaborative Group Tam- 
oxifen for early breast cancer: an overview of the randomised trials. Early Breast Cancer Tri- 
alists' Collaborative Group. Lancet, 557: 1451-1467,- 1998.2, 3). It is also generally accepted 
that patients with poor prognostic features benefit the most from adjuvant therapy, whereas 
some patients with- good prognostic features may be overtreated (Goldhirsch et al.: Meeting* 
highlights: International Consensus Panel on -the Treatment of Primary Breast Cancer. Sev- 
enth International Conference on Adjuvant Therapy of Primary Breast Cancer. J. Clin. Oncol., 
.79: 3817-3827, 2001.). Moreover many other factors have been investigated for their potential 
to predict disease outcome, but in general they have only limited predictive value. Recently, 
interesting prognostic parameters including gene-expression profiles, cell cycle regulating 



proteins and occult cytokeratin-positive metastatic cells in the bone marrow have been added 
to the list of prognostic factors, but their prognostic relevance needs to be further evaluated. 

Changes in the status of DNA methylation, known as epigenetic alterations, are one of the 
most common molecular alterations in human neoplasia, including breast cancer 
(Widschwendter and Jones: DNA methylation and breast carcinogenesis. Oncogene, 21: 
5462-5482, 2002). Cytosine methylation occurs after DNA synthesis by enzymatic transfer of 
a methyl group from the methyl donor S-adenosylmethionine to the carbon-5 position of cyto- 
sine. Cytosines are methylated in the human genome mostly when located 5' to a guanosine. 
Regions with a high G:C content are. so-called CpG islands. It has been increasingly recog- 
nized over the past four to five years that the CpG islands of a large number of genes, which 
are mostly unmethylated in normal tissue, are. methylated to. varying degrees in human can- 
cers, thus representing tumor-specific alterations: The presence of abnormally high DNA con- 
centrations in "the serum of patients with various malignant diseases was described several 
years ago. The discovery that cell-free DNA can be shed into the bloodstream has generated 
great interest. Numerous studies have demonstrated tumor-specific alterations in DNA recov- 
ered from plasma or serum of patients with various malignancies, a finding that has potential 
for molecular, diagnosis and prognosis. The nucleic acid markers described in plasma and se- 
rum include oncogene mutations, microsatellite alterations, gene rearrangements and epige- 
netic alterations, such as aberrant promoter hypermethylation (Anker et al.: Detection of cir- 
culating tumour DNA in the blood (plasma/serum) of cancer patients. Cancer Metastasis Rev,, 
18: 65-73, 1999). During recent years some studies have reported cell-free DNA in se- 
rum/plasma of breast cancer patients at diagnosis (for example: Silva et al.: Presence of tumor 
DNA in plasma of breast cancer patients: clinicopathological correlations. Cancer Res., 59: 
3251-3256, 1999) and in some cases persistence after primary therapy (for example: Silva et 
al.: Persistence of tumor DNA in plasma of breast cancer patients after mastectomy. Ann. 
Surg. Oncol, 9: 71-76, 2002). Nevertheless an increasing number of studies have reported the 
presence of methylated DNA in serum/plasma of patients with various types of malignancies, 
including breast cancer, and the absence of methylated DNA in normal control patients (for 
example: Wong et ai.: Detection of aberrant pl6 methylation in the plasma and serum- of liver 
cancer patients. Cancer Res., 59: 71-73, 1999). So far, only few studies have addressed the 
prognostic value of these epigenetic alterations in patients' bloodstream (Kawakami et al.: 
Hypermethylated APC DNA in plasma and prognosis of patients with esophageal adenocarci- 
noma. J. Natl. Cancer Inst., 92: 1805-1811, 2000; Lecomte et al.: Detection of free-circulating 




tumor-associated DNA in plasma of colorectal cancer patients and its association with prog- 
nosis. Int. J. Cancer, 100: 542-548, 2002). 

It will be appreciated by those skilled in the art that there exists a continuing need to imprdve 
methods of early detection, classification and treatment of breast cancers. In this application 
prognostic and diagnostic DNA methylation-based markers for breast cancer are disclosed. 

5-methylcytosine positions cannot be identified by sequencing since 5-methylcytosine has the 
same base pairing behavior as cytosine. Moreover, the. epigenetic information carried by 5- 
methylcytosine is completely lost during PCR amplification. Currently the most frequently, 
used method for analysing DNA for 5-methylcytosine is based upon the specific -reaction of 
bisulfite with cytosine .which, upon subsequent alkaline hydrolysis, is converted to uracil 
which corresponds to thymidine in its base pairing behaviour; However, 5-methylcytosine 
remains unmodified under these conditions. Consequently, the original DNA is converted in 
such a manner that methylcytosine, which originally could not be distinguished from cytosine 
by its hybridisation behaviour, can now be detected as the only remaining cytosine using 
"normal" molecular biological techniques, for example, by amplification and hybridisation or 
sequencing. All of these techniques are based on base pairing which can now be fully ex- 
ploited. In terms of sensitivity, the prior art is defined by a method which encloses the DNA 
to be analysed in: an agarose matrix, thus preventing the diffusion and renaturation of the 
DNA (bisulfite only reacts with single-stranded DNA), and which replaces all precipitation 
and purification steps with fast dialysis (Olek A, Oswald J, Walter J. A modified and im- 
proved method for bisulphite based cytosine methylation analysis; Nucleic Acids Res. 1996 
Dec 15;24(24):5064-6). Using this method, it is possible to analyse individual cells, which 
illustrates the potential of the method. However, currently only individual regions of a length 
of up to approximately 3000 base pairs are analysed, a global analysis of cells for thousands 
of possible methylation events is not possible/However, this method cannot reliably analyse 
• very small fragments from small sample quantities either: These are lost through the matrix in 
spite of the diffusion protection. 

An overview of the further known methods of detecting 5-methylcytosine may be gathered 
from the following review article: Fraga and Esteller: DNA Methylation: A Profile of Meth- 
ods and Applications. Bidtechniques 33:632-649, Sept. 2002. 





To date, barring few exceptions (e.g., Zeschnigk M, Lich C, Buiting K, Doerfler W, 
Horsthemke B. A single-tube PCR test for the diagnosis of Angelman and Prader-Willi syn- 
drome based on allelic methylation differences at the SNRPN locus. Eur J Hum Genet. 1997 
Mar-Apr;5(2):94-8) the bisulfite technique is only used in research. Always, however, short, 
specific fragments of a known gene are amplified subsequent to a bisulfite treatment and ei- • 
ther completely sequenced (Olek A, Walter J. The pre-implantation ontogeny of the HI 9 
methylation imprint. Nat Genet. 1997 Nov;17(3):275-6) or individual cytosine positions are 
detected by a primer extension reaction (Gonzalgo ML, Jones PA. Rapid quantitation of 
methylation differences at specific sites using methylation-sensitiye single nucleotide primer 
extension (Ms-SNuPE). Nucleic Acids Res. 1997 Jun 1 5;25(12):2529-3 1 , WO 95/00669) or 
by enzymatic digestion (Xiong Z, Laird PW. COBRA: a sensitive and quantitative DNA 
methylation assay. Nucleic Acids Res. 1997 Jun 15;25(12):2532-4). In addition, detection by 
hybridisation has also been described (Olek et al., WO 99/28498). 

Further publications dealing with the use of the bisulfite technique for methylation detection 
in individual genes are: Grigg G, Clark S.. Sequencing 5-methylcytosine residues in genomic 
DNA. Bioessays. 1994 Jun;16(6):43 1-6, 431 ; Zeschnigk M, Schmitz B, Dittrich B, Buiting K, 
Horsthemke B, Doerfler W. Imprinted segments in the human genome: different DNA meth- 
ylation patterns in the Prader-Willi/Angelman syndrome region as determined by the genomic 
sequencing method. Hum Mol Genet. 1997 Mar;6(3):387-95; Feil R, Charlton J, Bird AP, 
Walter J, Reik W. Methylation analysis on individual chromosomes: improved protocol for 
bisulphite genomic sequencing. Nucleic Acids Res. . 1994 Feb 25;22(4):695-6; Martin V, 
Ribieras S, Song- Wang X, Rio MC, Dante R; Genomic sequencing indicates a correlation 
between DNA hypomethylation in the 5" region of the pS2 gene and its expression in human 
breast cancer cell lines. Gene. 1995 May 19;157(l-2):261-4; WO 97746705, WO 95/15373, 
and WO 97/45560. 

»»* 

An overview of the Prior Art in oligomer array manufacturing can be gathered from a special 
edition of Nature Genetics (Nature Genetics Supplement, Volume 21, January 1999), pub- 
lished in January 1 999, and from the literature cited therein. 

Fluorescently labelled probes are often used for the scanning of immobilised DNA arrays. 
The simple attachment of Cy3 and Cy5 dyes to the 5'-OH of the specific probe are particularly 
suitable for fluorescence labels. The detection of the fluorescence of the hybridised probes 



may be carried out, for example via a confocal microscope. Cy3 and Cy5 dyes, besides many 
others, are commercially available. 

Matrix Assisted Laser Desorption Ionisation Mass Spectrometry (MALDI-TOF) is a very 
efficient development for the analysis of biomolecules (Karas M, Hillenkamp F. Laser de- 
sorption ionisation of proteins with molecular masses exceeding 10,000 daltons. Anal Chem. 
1988 Oct 15;60(20):2299-301). An analyte is embedded in a light-absorbing matrix. The ma- 
trix is evaporated by a short laser pulse thus transporting the analyte molecule into the vapour 
phase in an unfragmented manner. The analyte is ionised by collisions with matrix molecules. 
An applied voltage accelerates the ions into a field-free flight tube. Due to their different 
masses, the ions are accelerated at different rates. Smaller ions reach the detector sooner than 
bigger ones. 

MALDI-TOF spectrometry is excellently suited to the analysis of peptides and proteins. The 
analysis of nucleic acids is somewhat more difficult (Gut I G, Beck S. DNA and Matrix As- 
sisted Laser Desorption Ionisation Mass Spectrometry. Current Innovations and Future 
Trends. 1995, 1; 147-57). The sensitivity to nucleic acids is approximately 100 times worse 
than to peptides and decreases disproportionally with increasing fragment size. For nucleic 
acids having a multiply negatively charged backbone, the ionisation process via the matrix is 
considerably less efficient. In MALDI-TOF spectrometry, the selection of the matrix plays an 
eminently important role. For the desorption of peptides, several very efficient matrixes have 
been found which produce a very fine crystallisation. There are now several responsive ma- 
trixes for DNA, however, the difference-in sensitivity has not been reduced. The difference in 
sensitivity can be reduced by. chemically modifying the DNA in such a manner that it be- 
comes more similar to a peptide. Phosphorothioate nucleic acids in which the usual phos- 
phates of the backbone are substituted with thiophosphates can be converted into a charge- 
neutral DNA using simple alkylaticn chemistry (Gut IG, Beck S. A procedure for selective 
DNA alkylation and detection by mass , spectrometry. Nucleic Acids Res. 1995 Apr 
25;23(8): 1367-73). The coupling of a charge tag to this modified DNA results in an increase 
in sensitivity to the same, level as that found for peptides. A further advantage of charge tag- 
ging is the increased stability of the analysis against impurities, which make the detection of 
unmodified substrates considerably more difficult. 



Genomic. DNA is obtained from DNA of cell, tissue or other test samples using standard 
methods. This standard methodology is found in references such as Fritsch and Maniatis eds., 
Molecular Cloning: A Laboratory Manual, 1 989. 

Description 

The present invention provides methods and nucleic acids for the analysis of biological sam- 
ples for features associated with the development of breast cell proliferative disorders. The 
invention is characterised in that the nucleic acid of at least one member of the group of genes 
according to Table 1 is/are contacted with a reagent or series of reagents capable of distin- 
guishing between methylated and non methylated CpG dinucleotides within the genomic se- 
quence of interest. The present invention makes available a method for ascertaining genetic 
and/or epigenetic parameters of genomic DNA. The method is for use for the determining the 
prognosis of breast cell proliferative disorders. The invention presents improvements over the 
state of the art in that by means of the methods and compounds described herein a person 
skilled in the art may carry out a sensitive and specific detection assay of cellular matter com- 
' prising cancerous breast tissue. This is particularly useful as it allows the analysis of samples 
of body fluids which may contain only a minimal amount of cell proliferative disorder cellular 
matter, and enables the detection of said cells and the identification of the organ from which 
they originated (in this case breast). To date there are no known clinically utilisable means for 
the detection of breast cancer using genetic methylation markers to analyse bodily fluid sam- 
ples, such as blood, lymphatic fluids, nipple aspirate and plasma. The generated information 
is useful in the selection of a treatment of the patient. If a positive prognosis is determined a 
further treatment might be redundant, while in a case of a poor prognosis a stronger treatment 
might be necessary. • 

Furthermore, the method enables the analysis of cytosine methylations and single nucleotide 
polymorphisms. 

The genes that form the basis of the present invention are preferably to be used to form a 
"gene panel", i.e. a collection comprising the particular genetic sequences of the present in- 
vention and/or their respective informative methylation sites. The formation of gene panels 
allows for a quick and specific analysis of specific aspects of breast cancer. The gene panel(s) 
as described and employed in this invention can be used with surprisingly high efficiency for 





the diagnosis, treatment and monitoring of and the analysis of a predisposition to breast cell 
proliferative disorders. 

In addition, the use of multiple CpG sites from a diverse array of genes allowsfor a relatively 
high degree of sensitivity and specificity in comparison to single gene diagnostic and detec- " 
tion tools/Of the genes known to be specifically methylated in breast cancer, the particular 
combination of the genes according to the invention provides for a particularly sensitive and 
specific means for the identification of cell proliferative disorders of breast tissues. 

The object of the invention is most preferably achieved by means of the analysis of the meth- 
ylation patterns of one or a combination of genes taken from the group taken from the group 
ESRl, APC, HSD174B4, HICl and RASSFIA (see, for example, Table 1) and/or their regu- 
latory regions. The invention is characterised in that the nucleic acid of one or a combination 
of genes taken from the group ESRl, APC, HSD174B4, HICl and RASSFIA are contacted 
with a reagent or series of reagents capable of distinguishing between methylated and non 
methylated CpG clmucleotides within the genomic, sequence of interest. 

The object of the invention can also be achieved by the analysis of the CpG methylation of 
one or a plurality of any subset of the group of genes ESRl, APC, HSD174B4, HICl and 
RASSFIA, in particular the following subsets are preferred: 



.. RASSFIA and APC, 
. RASSFIA, and 
- APC 

The present invention makes available a method for ascertaining genetic and/or epigenetic 
parameters of genomic DNA. The method is for use- in the improved diagnosis, treatment and 
monitoring of breast cell proliferative disorders. 

The disclosed invention further provides a method for deterrmning the phenotype of a subject 
with a breast cell proliferative disorder comprising 

a) obtaining a biological sample containing genomic DNA from said subject, 

b) analysing the methylation pattern of one or more target nucleic acids comprising one or a 
combination of the genes taken from the group consisting of ESRl, APC, HSD174B4, HICl 



and RASSF1A and/or their regulatory regions by contacting at least one of said target nucleic 
acids in the biological sample obtained from said subject with at least one reagent, or series of- 
reagents that distinguishes between methylated and non-methylated CpG dinucleotides, and 
c) determining the phenotype of the individual by comparison to two known phenotypes, a 
first phenotype characterised by hypermethylation of the target nucleic acid and poor progno- 
sis as relative to a second phenotype characterised by hypomethylation of the analysed target 
nucleic acid and better prognosis 

The DNA may be obtained from any form of biological sample including but not limited to 
cell lines, histological slides, biopsies, tissue embedded in paraffin, breast tissues, blood, 
plasma, lymphatic fluid, lymphatic tissue, duct cells, ductal lavage fluid, nipple aspiration 
fluid and combinations thereof. Genomic DNA must then be isolated from the sample, using 
any means standard in the art. The isolated DNA is treated with at least one reagent, or series 
of reagents that distinguishes between methylated and non-methylated CpG dinucleotides. 
This may be carried out by any means standard in the art including the use of restriction en- 
donucleases. However, it is preferably carried out with bisulfite (sulfite, disulfite) and subse- 
quent alkaline hydrolysis which results in a conversion of non-methylated cytosine nucleo- 
bases to uracil or to another base which is dissimilar to cytosine in terms of base pairing be- 
haviour. If bisulfite solution is used for the reaction, then an addition takes place at the non- 
methylated cytosine bases. Moreover, a denaturating reagent or solvent as well as a radical 
interceptor must.be present. A subsequent alkaline hydrolysis then gives rise to the conversion 
of non-methylated cytosine nucleobases to uracil. The converted DNA. is then used for the 
detection of methylated cytosines. The methylation status of one or more of the genes ESR1, 
APC, HSD174B4, HIC1 and RASSF1 Aand/or their regulatory regions is then analysed. This 
analysis may be carried out by any means standard in the art including the above described 
techniques. In the final step of the method the methylation pattern of the DNA obtained from 
the subject is compared to that of two known phenotypes. The first phenotype is characterised 
by hypermethylation or methylation of the target nucleic acid and poor prognosis as relative 
to a second phenotype characterised by hypomethylation or no methylation of the analysed 
target nucleic acid and better prognosis. It is particularly preferred that the genes APC 
and/RASSFl A are analysed. By determining which of the two phenotypes the subject belongs 
to it is possible to determine a suitable treatment to her breast cell proliferative disorder. 





The method according to the invention may be used for the analysis of a wide variety of cell 
proliferative disorders of the breast tissues including, but not limited to, ductal carcinoma in 
situ, lobular carcinoma, colloid carcinoma, tubular carcinoma, medullary carcinoma, meta- 
plastic carcinoma, intraductal carcinoma in situ, lobular carcinoma in situ arid papillary carci- 
noma in situ. 

Furthermore, the method enables the analysis of cytosine methylations .and single nucleotide 
polymorphisms within said genes. 

the object of the invention is achieved by means of the analysis of the methylation patterns of 
one or more of the genes ESR1, APC, HSD174B4, HIC1 and RASSF1A and/or their regula- 
tory regions. In a particularly preferred embodiment the sequences of said genes comprise 
SEQ ID NOs: 1 to 5 and sequences complementary thereto. 

The object of the invention may also be achieved by analysing the methylation patterns of one 
or more genes taken from the following subsets- of said aforementioned group of genes. In one 
embodiment the object of the invention is achieved by analysis of the methylation patterns of 
the genes RASSF1A and APC and wherein it is further preferred that the sequence of said 
genes comprise SEQ ID NOs: 5 and 3, respectively. In a further embodiment the object of the 
invention is achieved by analysis of the methylation patterns of the gene RASSF1 A and/or its 
regulatory sequences, and wherein it is further preferred that the sequence of said gene com- 
prises SEQ ID NO: 5. The object of the invention may also be achieved by analysis of the 
methylation pattern, of the gene APC and/or its regulatory sequences, and wherein it is further 
preferred that the sequence of said gene comprises SEQ ID NO: 3. 

In a preferred embodiment said method is achieved by contacting said nucleic acid sequences 
in a biological sample obtained from a subject with at least one reagent or a series of reagents, 
wherein said reagent or series of reagents, distinguishes between methylated and non methyl- 
ated CpG dinucleotides within the target nucleic acid. . • . 

In a preferred embodiment, the method comprises the following steps: 
In the first step of the method the genomic DNA sample must be isolated from sources such 
as cells or cellular components which contain DNA, sources of DNA comprising, for exam- 
ple, cell lines, histological slides, biopsies, tissue embedded in paraffin, breast tissues, blood, 



plasma, lymphatic fluid, lymphatic tissue, duct cells, ductal lavage fluid, nipple aspiration 
fluid and combinations thereof. Extraction may be by means that are standard to one skilled in 
the art, these include the use of detergent lysates, sonification and vortexing with glass beads. 
Once the nucleic acids have been extracted the genomic double stranded DNA is used in the 
analysis. 

In one embodiment the DNA may be cleaved prior to the next step of the method, this may be 
by any means standard in the state of the art, in particular, but not limited to, with restriction 
endonucleases. 

In the second step of the method, the genomic DNA sample is treated in such a manner that 
cytosine bases which are unmethylated at the 5 '-position. are converted to uracil,. thymine, or 
another base which is dissimilar to cytosine in terms of hybridisation behaviour. This will be 
understood as "pretreatment" or "chemical pretreatment" hereinafter. 

The above described treatment of genomic DNA is preferably carried out with bisulfite (sul- 
fite, disulfite) and subsequent alkaline hydrolysis which results in a conversion of non- 
methylated cytosine nucleobases to uracil or to. another base which is dissimilar to cytosine in 
terms of base pairing behaviour. If bisulfite solution is used for the reaction, then an addition 
takes place at the non-methylated cytosine bases. Moreover, a denaturating reagent or solvent 
as well as a radical interceptor must be present. A subsequent alkaline hydrolysis then gives 
rise to the conversion of non-methylated cytosine nucleobases to uracil. The converted DNA 
is then used for the detection of methylated cytosines. 

Fragments of the pretreated DNA are amplified, using sets of primer oligonucleotides, and a 
preferably heat-stable, polymerase. Because of statistical and practical considerations, pref- 

i,- t 

erably more than six different fragments having a length of 100 - 2000 base pairs are ampli- 
fied. The amplification of several DNA segments can be carried out simultaneously in one 
and the same reaction vessel. Usually, the amplification is carried out by means of a polymer- 
ase chain reaction (PCR). 

The design of such primers is obvious to one skilled in the art. These should include at least 
two oligonucleotides whose sequences are each reverse complementary or identical to an at 
least 18 base-pair long segment of the' following base sequences specified in the appendix: 



r 





SEQ ID NO 6 to 26. Said primer oligonucleotides are preferably characterised in that they do 
not contain any CpG dinucleotides. In a particularly preferred embodiment of the method, the 
sequence of said primer oligonucleotides are designed so as to selectively anneal to and am- 
plify, only the breast cell specific DNA of interest, thereby niinimising the amplification of 
background or non relevant DNA. In the context of the present invention, background DNA is 
taken to mean genomic. DNA which does not have a relevant tissue specific methylation pat- 
tern, in this case, the relevant tissue being breast tissues. 

According to the present invention, it is preferred that at least one primer oligonucleotide is 
bound to a solid phase during amplification. The different oligonucleotide and/or PNA- 
oligomer sequences can be arranged on a plane solid phase in the form of a rectangular or 
hexagonal lattice, the solid phase surface preferably being composed of silicon, glass, polysty- 
rene, aluminium, steel, iron, copper, nickel, silver, or gold, it being possible for other materi- 
als such as nitrocellulose or plastics to be used as well. 

The fragments obtained by means of the amplification may carry a directly or indirectly de- 
tectable label. Preferred are labels in the form of fluorescence labels, radionuclides, or detach- 
able molecule fragments having a typical mass which can be detected in a mass spectrometer, 
it being preferred that the fragments that are produced have a single positive or negative net 
charge for better detectability in the mass spectrometer. The detection may be carried out and 
visualised by means of matrix assisted laser desorption/ionisation mass spectrometry 
(MALDI) or using electron spray mass spectrometry (ESI). 

In the next step the nucleic acid amplificates are analysed in order to determine the methyla- 
tion status of the genomic DNA prior to treatment. 

' The post treatment analysis of the nucleic acids may be carried out using alternative methods. 
Several methods' for the methylation status specific analysis of the treated nucleic acids are 
described below, other alternative methods will be obvious to one skilled in the art. 

The analysis may be carried out during the amplification step of the method. In one such em- 
bodiment, the methylation status of preselected CpG positions within the genes ESR1, APC, 
HSD174B4, HIC1 and RASSF1A and/or their regulatory regions may be detected by use of 
methylation specific primer oligonucleotides. The term "MSP" (Methylation-specific PCR) 
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refers to the art-recognized methylation assay described by Herman et al. Proc. Natl. Acad. 
Sci. USA 93:9821-9826, 1996, and also disclosed in US Patents No. 5,786,146 and No. 
6,265,171. The use. of methylation status specific primers for the amplification of bisulphite 
treated DNA allows the differentiation between methylated and unmethylated nucleic acids. 
MSP primers pairs contain at least one primer which hybridises to a bisulphite treated CpG 
dinucleotide. Therefore the sequence of said primers comprises at least one CG, TG or CA 
dinucleotide. MSP primers specific for non methylated DNA contain a T at the 3' position of 
the C position in the CpG. According to the present invention, it is therefore preferred that the 
base sequence of said primers is required to comprise a sequence having a length of at least 10 
nucleotides which hybridises to a pretreated nucleic acid sequence according to SEQ ID 
NOs. : 6 to 26 and sequences complementary thereto wherein the base sequence of said oli- 
gomers comprises at least one CG, TG or CA. dinucleotide. 

In one embodiment of the method the methylation status of the CpG positions may be deter- 
• mined by means of hybridisation analysis. In this embodiment of the method the amplificates 
obtained in the second step of the method are hybridised to an array or a set of oligonucleo- 
tides and/or PNA probes. In this context, the hybridisation takes place in the manner de- 
scribed as follows. The set of probes used during the hybridisation is preferably composed of 
at least 4 oligonucleotides or PNA-oligomers. In the process, the amplificates serve as probes 
which hybridise to oligonucleotides previously bonded to a solid phase. The non-hybridised 
fragments are subsequently removed. Said oligonucleotides contain at least one base sequence 
having a length of 10 nucleotides which is reverse complementary or identical to a segment of 
the base sequences specified in the appendix,.the segment containing at least one CpG or TpG 
dinucleotide. In a further preferred embodiment the cytosine of the CpG dinucleotide, or in 
the case of TpG, the thiamine, is the 5 th to 9 th nucleotide from the 5'-end of the 10-mer. One 
oligonucleotide exists for each CpG or TpG dinucleotide. 

The non-hybridised amplificates are then removed. In the final step of the method, the hy- 
bridised amplificates are detected. In this context, it is preferred that labels attached to the 
amplificates are identifiable at each position of the solid phase at which an oligonucleotide 
sequence is located. 

In a preferred embodiment of the method the methylation status of the CpG positions may be 
determined by means of oligonucleotide probes.that are hybridised to the treated DNA con- 



currently with the PCR amplification primers (wherein said primers may either be methyla- 
tion specific or standard). 

A particularly preferred embodiment of this method is the use of fluorescence-based Real 
Time Quantitative PCR (Heid et al., Genome Res. 6:986-994, 1996) employing a dual- 
labelled fluorescent oligonucleotide probe (TaqMan™ PCR, using an ABI Prism 7700 Se- 
quence Detection System, Perkin Elmer Applied Biosystems, Foster City, California). The 
TaqMan™ PCR reaction employs the use of a nonextendible interrogating oligonucleotide, 
called a TaqMan™ probe, which is designed to hybridise to a GpC-rich sequence located 
between the forward and reverse amplification primers. The TaqMan™ probe further com- 
prises a fluorescent "reporter moiety" and a "quencher moiety" covalently bound to linker 
moieties (e.g., phosphoramidites) attached to the nucleotides of the TaqMan™ oligonucleo- 
tide. For analysis of methylation within nucleic acids subsequent to bisulphite treatment it is 
required that the probe be methylation specific, as described in U.S. 6,33 1,393, also known as 
the Methyl Light assay. Variations on the TaqMan™ detection methodology that are also 
suitable for use with the described invention include the use of dual probe technology 
(Lightcycler™) or fluorescent amplification primers (Sunrise™ technology). Both these tech- 
niques may be adapted in a manner suitable for use with bisulphite treated DNA, and moreo- 
ver for methylation analysis within CpG dinucleotides. 

A further suitable method for the use of probe oligonucleotides for the assessment of meth- 
ylation by analysis of bisulphite treated nucleic acids is the use of blocker oligonucleotides. 
The use of such oligonucleotides has been described by D. Yu, M. Mukai, Q. Liu, C. Stein- 
man in BioTechniques 23(4), 1997, 714-720. Blocking probe oligonucleotides are hybridised 
to the bisulphite treated nucleic acid concurrently with the PCR primers. PCR amplification of 
the nucleic acid is terminated at the 5' position of the blocking probe, thereby amplification of 
a nucleic acid is suppressed wherein the complementary sequence to the blocking probe is 
present. The probes may be designed to. hybridise to the bisulphite treated nucleic acid in a 
methylation status specific manner. For example, for detection of methylated nucleic acids 
within a population of unmethylated nucleic acids suppression of the amplification of nucleic 
acids which are unmethylated at the position in question would be carried out by the use of 
blocking probes comprising a 'CG' at the position in question, as opposed to a 'CA\ 



For PCR methods using blocker oligonucleotides, efficient disruption of polymerase-mediated 
amplification requires that blocker oligonucleotides not be elongated by the polymerase. Pref- 
erably, this is achieved through the use of blockers that are 3'-deoxyoligonucleotides, or oli- 
gonucleotides derivatised at the 3' position with other than a "free" hydroxyl group. For ex- 
ample, S'-O-acetyl oligonucleotides are representative of a preferred class of blocker mole- 
cule. 

Additionally, polymerase-mediated decomposition of the blocker oligonucleotides should be 
precluded. Preferably, such preclusion comprises either use of a polymerase lacking 5 '-3' 
exonuclease activity, or use of modified blocker oligonucleotides having, for example, thioate 
bridges at the 5'-terminii thereof that render the blocker molecule nuclease-resistant. Particu- 
lar applications may not require such 5' modifications of the blocker. For example, if the 
blocker- and primer-bmding sites overlap, thereby precluding binding of the primer (e.g., with 
excess blocker), degradation of the blocker oligonucleotide will be substantially precluded. 
This is because the polymerase will not extend the primer toward, and through (in the 5 '-3' 
direction) the blocker - a process that normally results in degradation of the hybridized 
blocker oligonucleotide. 

A particularly preferred blocker/PCR embodiment, for purposes of the present invention and 
as implemented herein, comprises the use of peptide nucleic acid (PNA) oligomers as block- 
ing oligonucleotides. Such PNA blocker oligomers are ideally suited, because they are neither 
decomposed nor extended by the polymerase. 

Preferably, therefore, the base sequence of said blocking oligonucleotides is required to com- 
prise a sequence having a length of at least 9 nucleotides which hybridises to a pretreated nu- 
cleic acid sequence according to one of SEQ ID NOs: 6 to 26 and sequences complementary 
thereto, wherein the base sequence , of said oligonucleotides comprises at least one CpG, TpG 
or CpA.dinucleotide. 

In a further preferred embodiment of the method the determination of the methylation status 
of the CpG positions is carried out by the use of template directed oligonucleotide extension, 
such as MS SNuPE as described by Gonzalgo and Jones (Nucleic Acids Res. 25:2529-2531). 





In a further embodiment of the method the determination of the methylation status of the CpG 
positions is enabled by sequencing and subsequent sequence analysis of the amplificate gen- 
erated in the second step of the method (Sanger F., et al., 1977 PNAS USA 74: 5463-5467). 

The method according to the invention may be enabled by any combination of the above 
means. In a particularly preferred mode of the invention the use of real time detection probes 
is concurrently combined with MSP and/or blocker oligonucleotides. 

A further embodiment of the invention is a method for the analysis of the methylation status 
of genomic DNA without the need for pretreatment. In the first and second steps of the 
method the genomic DNA sample must be obtained and isolated from tissue or cellular 
sources. Such sources may include cell lines, histological slides, biopsies, tissue embedded in 
paraffin, breast tissues, blood, plasma, lymphatic fluid, lymphatic tissue, duct cells, ductal 
lavage fluid, nipple aspiration fluid and combinations thereof. Extraction may be by means 
that are standard to one skilled in the art, these include the use of detergent lysates, sonifica- 
tion and vortexing with glass beads. Once the nucleic acids have been extracted the genomic 
double stranded DNA is used in the analysis. 

In a preferred embodiment the DNA may be cleaved prior to the treatment, this may be by 
any means standard in the state of the art, in particular with restriction endonucleases. In the . 
third step, the DNA is then digested with one or more methylation sensitive restriction en- 
zymes. The digestion is carried out such that hydrolysis of the DNA at the restriction site is 
informative of the methylation status of a specific CpG dinucleotide. 

In a preferred embodiment the restriction fragments are amplified. In a further preferred em- 
bodiment this is carried out using the polymerase chain reaction. 

In the final step the amplificates are detected. The detection may be by any means standard in 
the art, for example, but not limited to, gel electrophoresis analysis, hybridisation analysis, 
incorporation of detectable tags within the PCR products, DNA array analysis, MALDI or 
ESI analysis. 

The aforementioned method is preferably used for ascertaining genetic and/or epigenetic pa- 
rameters of genomic DNA. 



In order to enable this method, the invention further provides the modified DNA of one or a 
combination of genes taken from the group ESR1, APC, HSD174B4, HIC1 and RASSF1A as 
well as oligonucleotides and/or PNA-oligomers for detecting cytosine methylations within 
said genes. The present invention is based on the discovery that genetic and epigenetic pa- 
rameters and, in particular, the cytosine methylation patterns of said genomic DNAs are par- 
ticularly suitable for improved treatment and monitoring of breast cell proliferative disorders. 

The nucleic acids according to the present invention can be used for the analysis of genetic 
and/or epigenetic parameters of genomic DNA. 

In another aspect of the present invention, the object of the present invention is achieved us- 
ing a nucleic acid containing a sequence of at least 18 bases in length of the pretreated geno- 



mic 



DNA according to one of SEQ ID NOs: 6 to 25 and sequences complementary thereto. 



The modified nucleic acids could heretofore not be connected with the ascertainment of dis- 
ease relevant genetic and epigenetic parameters. 

The object of the present invention is further achieved by an oligonucleotide or oligomer for . 
the analysis of pretreated DNA, for detecting the genomic cytosine methylation state, said 
oligonucleotide containing at least one base sequence having a length of at least 10 nucleo- 
tides which hybridises to a pretreated genomic DNA according to SEQ ID Nos: 6 to 26. The 
oligomer probes according to the present invention constitute, important and effective tools 
Which, for the first time,.make it possible to ascertain specific genetic and epigenetic parame- 
ters during the analysis of biological samples for features associated with, a patient's response 
to endocrine treatment. Said oligonucleotides allow the improved treatment and monitoring of 
breast cell proliferative disorders. The base sequence of the oligomers preferably contains at 
least one CpG or TpG dinucleotide. The probes may also exist in the form of a PNA (peptide 
nucleic acid) which has particularly preferred pairing properties. Particularly preferred are 
oligonucleotides according to the present invention in which the cytosine of the CpG dinu- 
cleotide is within the middle third of said oligonucleotide e.g. the 5 th - 9 th nucleotide from 
the 5'-end of a 13-mer oligonucleotide; or in the case of PNA-oligomers, it is preferred for the 
cytosine of the CpG. dinucleotide to be the 4* - 6 th nucleotide from the 5'-end of the 9-mer. 



The oligomers according to the present invention are normally used in so called "sets" which 
contain at least two oligomers and up to one oligomer for each of the CpG dinucleotides 
within SEQ ID NOs: 6 to 26. 

In the case of the sets of oligonucleotides according to the present invention, it is preferred 
that at least one oligonucleotide is bound to a solid phase. It is further preferred that all the 
oligonucleotides of one set are bound to a solid phase. 

The present invention further relates to a set of at least 2 n (oligonucleotides and/or PNA- 
oligomers) used for detecting the cytosine methylation state of genomic DNA, by analysis of 
said sequence or treated versions of said sequence (of the genes ESR1, APC, HSD174B4, 
HIC1 and RASSF1A, as detailed in the sequence listing and Table l);and sequences comple- 
mentary thereto). These probes enable improved treatment and monitoring of breast cell pro- 
liferative disorders. 

The set of oligomers may also be used for detecting single nucleotide polymorphisms (SNPs) 
by analysis of said sequence or treated versions of said sequence of the genes ESR1, APC, 
HSD174B4, HIC1 and RASSF1A . 

It will be obvious to one skilled in the art that the method according to the invention will be 
improved and supplemented by the incorporation of markers and clinical indicators known in 
the state of the art and currently used as diagnostic or prognostic markers. More preferably 
said markers include node status, age, menopausal status, grade, estrogen and progesterone 
receptors. 

The genes that form the basis of the present invention may be used to form a "gene panel", i.e. 
a collection comprising the particular genetic sequences of the present invention and/or their 
respective informative methylation sites. The formation of gene panels allows for a quick and 
specific analysis of specific aspects of breast cancer treatment. The gene panel(s) a^ described 
and employed in this invention can be used with surprisingly high efficiency for the treatment 
of breast cell proliferative disorders by prediction of the outcome of treatment with a therapy 
comprising one or more drugs which target the estrogen receptor pathway or are involved in 
estrogen metabolism, production, or secretion. The analysis of each gene of the panel contrib- 
utes to the evaluation of patient responsiveness, however, in a less prefered embodiment the 



patient evaluation may be achieved by analysis of only a single gene. The analysis of a single 
member of the 'gene panel' would enable a cheap but less accurate means of evaluating pa- 
tient responsiveness, the analysis of multiple members of the panel would provide a rather 
more expensive means of carrying out the method, but with a higher accuracy (the technically 
preferred solution). 

According to the present invention, it is preferred that an arrangement of different oligonu-' 
cleotides and/or PNA-oligomers (a so-called "array") made available by the present invention 
is present in a manner that it is likewise bound to a solid phase. This array of different oligo- 
nucleotide- and/or PNA-oligomer sequences can be characterised in that it is arranged on the 
solid phase in the form of a rectangular or hexagonal lattice. The solid phase surface is pref- 
erably composed of silicon, glass, polystyrene, aluminium, steel,, iron, copper, nickel, silver, 
or gold. However, nitrocellulose as well as plastics such as nylon which can exist in the form 
of pellets or also as resin matrices are suitable alternatives. 

Therefore, a further subject matter of the present invention is a method for manufacturing an 
• array fixed to a carrier material for the improved treatment and monitoring of breast cell pro- 
liferative disorders. In said method at least one oligomer according to the present invention is . 
coupled to a solid phase. Methods for manufacturing such arrays are known, for example, 
from US Patent 5,744,305 by means of solid-phase chemistry and photolabile protecting 
groups. 

A further subject matter of the present invention Telates to a DNA chip for the improved 
treatment and monitoring of breast cell proliferative disorders. The DNA chip contains at least 
one nucleic acid according to the present invention. DNA chips are known, for example, in 
US Patent 5,837,832. 

Moreover, a subject matter of the present invention is a kit which may be composed, for ex- 
ample, of a bisuifite-containing reagent, a set of primer oligonucleotides containing at least 
two oligonucleotides whose sequences in each case correspond to or are complementary to a 
18 base long segment of the base sequences specified in SEQ ID.NOs: 6 to 26 and/or PNA- 
oligomers as well as instructions for carrying out and evaluating the described method. 



In a further preferred embodiment said kit may further comprise standard reagents for per- 
forming a CpG position specific methylation analysis wherein said analysis comprises one or 
more of the following techniques: MS-SNuPE, MSP, Methyl light, Heavy Methyl, and nu- 
cleic acid sequencing. However, a kit along the lines of the present invention can also contain 
only part of the aforementioned components. - 

Typical reagents (e.g., as might be found in a typical MethyLight®-based kit) for Meth- 
yLight® analysis may include, but are not limited to: PCR primers for specific gene (or meth- 
ylation-altered DNA sequence or CpG island); TaqMan® probes; optimized PCR buffers and 
deoxynucleotides; and Taq polymerase. 

Typical reagents (e.g., as might be found in a typical Ms-SNuPE-based kit) for Ms-SNuPE 
analysis may include, but are not limited to: PCR primers for specific gene (or methylation- 
altered DNA sequence or CpG island); optimized PCR buffers and deoxynucleotides; gel ex- 
traction kit; positive control primers; Ms-SNuPE primers for specific gene; reaction buffer 
(for the Ms-SNuPE reaction); and radioactive nucleotides. Additionally, bisulfite conversion 
reagents may include: DNA denaturation buffer; sulfonation buffer; DNA recovery regents or 
kit (e.g., precipitation, ultrafiltration, affinity column); desulfonation. buffer; and DNA recov- 
ery components. 

Typical reagents (e.g., as might be found in a typical MSP-based kit) for MSP analysis may 
include, but are not limited to: methylated and unmethylated PCR primers for specific gene 
(or methylation-altered DNA sequence or CpG island), optimized PCR buffers and deoxynu- 
cleotides, and specific probes. 

The oligomers according to the present invention or arrays thereof as well as a kit according 
to the present invention are intended to be used for the improved treatment and monitoring of 
breast cell proliferative disorders. According to the present invention, the method is prefera- 
bly used for the analysis of important genetic and/or epigenetic parameters within genomic 
DNA, in particular for use in improved treatment and monitoring of breast cell proliferative 
disorders. 

The methods according to the present invention are used, for improved detection, treatment 
and monitoring of breast cell proliferative disorder. - 



The present invention moreover relates to the diagnosis and/or prognosis of events which are 
disadvantageous or relevant to patients or individuals in which important genetic and/or epi- 
genetic parameters within genomic DNA, said parameters obtained by means of the present 
invention may be compared to another set of genetic and/or epigenetic parameters, the differ- 
ences serving as the basis for the diagnosis and/or prognosis of events which are disadvanta- 
geous or relevant to patients or individuals. 

In the context of the present invention the term "hybridisation" is to be understood as a bond 
of an oligonucleotide to a completely complementary sequence along the lines of the Watson- 
Crick base pairings in the sample DNA, foraiing a duplex structure. 

In the context of the present invention, "genetic parameters" are mutations and polymor- 
phisms of genomic DNA and sequences further required for their regulation. To be designated 
as mutations are, in particular, insertions, deletions, point mutations, inversions and polymor- 
phisms and, particularly preferred, SNPs (single nucleotide polymorphisms). 

In the context of the present invention the term "methylation state" is taken to mean the de- 
gree of methylation present in a nucleic acid of interest, this may be expressed in absolute or 
relative terms i.e. as a percentage or other numerical value or by comparison to another tissue 
and therein described as hypermethylated, hypomethylated or as having significantly similar 
or identical methylation status. 

In the context of the present invention the term "regulatory region" of a gene is taken to mean 
nucleotide sequences which affect the expression of a gene. Said regulatory regions may be 
located within, proximal or distal to said gene. Said regulatory regions include but are not 
limited to constitutive promoters, tissue-specific promoters, developmental-specific promot- 
ers, inducible promoters and the like. Promoter regulatory elements may also include certain 
enhancer sequence elements that control transcriptional or translational efficiency of the gene. 

In the context of the present invention the term "chemotherapy" is taken to mean the use of 
drugs or chemical substances to treat cancer. This definition excludes radiation therapy 
(treatment with high energy rays or particles), hormone therapy. (treatment with hormones or. 
hormone analogues (synthetic substitutes) and surgical treatment. 



In the context of the present invention, "epigenetic parameters" are, in particular, cytosine 
methylations and further modifications of DNA bases of genomic DNA and sequences further 
required for their regulation. Further epigenetic parameters include, for example, the acetyla- 
tion of histones which, cannot be directly analysed using the described method but which, in 
turn, correlates with the DNA methylation. 

In the following, the present invention will be explained in greater detail on the basis of the 
sequences, figures and examples without being limited thereto. 

Figure 1 shows the Kaplan-Meier estimated overall survival curves for the gene APC, for a 
set of 86 breast cancer patients. The dotted line (upper curve) shows unmethylated samples 
whereas the unbroken line (lower curve) shows methylated samples. The x-axis shows the 
number of years, and the Y-axis shows the proportion of the group. 

Figure 2 shows the Kaplan-Meier estimated overall survival curves for the gene RASSF1A, 
for a set of 86 breast cancer patients. The dotted line (upper curve) shows unmethylated sam- 
ples whereas the unbroken line (lower curve) shows methylated samples. The x-axis shows 
the number of years, and the Y-axis shows the proportion of the group. 

Figure 3 shows the combined Kaplan-Meier estimated, overall survival curves for the genes 
APC and/or RASSF1A, for a set of 86 breast cancer patients. The dotted line (upper curve) 
shows unmethylated samples whereas the unbroken line (lower curve) shows methylated, 
samples. The x-axis shows the number of years, and the Y-axis shows the proportion of the 
group. 

SEQ ID NOs: 1 to 5 represent 5' and/or regulatory regions and/or CpG rich regions of the 
genes according to Table 1. These sequences are derived from Genbank and will be taken to 
include all minor variations of the sequence material which are currently unforeseen, for ex- 
ample, but not limited to, minor deletions and SNPs. 

SEQ ID NOs: 6 to 26 exhibit the pretreated sequence of DNA derived from the genomic se- 
quence according to Table. 1. These sequences will be taken to include all minor variations of 



the sequence material which are currently unforeseen, for example, but not limited to, minor 
deletions and SNPs. 



Examples 

Using MethyLight, a high-throughput DNA methylation assay, the inventors analysed 3? 
genes in a gene evaluation set, consisting often sera from metastasised patients, 26 patients 
with primary breast cancer and ten control patients. In order to determine the prognostic value 
of genes identified within the gene evaluation set, the inventors finally analysed pretreatment 
sera of 24 patients having had no adjuvant treatment (training set) to determine their prognos- 
tic value. An independent test set consisting of 62 patients was then used to test the validity of. 
genes and combinations of genes, which in the training set were found to be good, prqgnostic 
markers. 

In the gene evaluation set the inventors identified five genes (ESR1, APC, HSD17B4, HIC1 
and RASSF1A). In the training set, patients with methylated serum DNA for RASSF1A 
and/or APC had the worst prognosis (p < 0.001). This finding was confirmed by analysing 
serum samples from the independent test set (p - 0.007). When analysing all 86 investigated 
patients, multivariate analysis showed methylated RASSF1A and/or APC serum DNA to be 
independently associated with poor outcome, with a relative risk for death of 5.7. DNA meth- 
ylation. of particular genes in pretherapeutic sera of breast cancer patients, especially of 
RASSF1 A/APC, is more powerful than standard prognostic parameters. 

The gene evaluation set consisted of patients with recurrent disease (n=10; sera obtained at 
diagnosis of metastasis in the bone, lung, brain or liver) and pretherapeutic sera of recently 
diagnosed primary breast cancer patients (n=26; age range: 36.1 yrs to 83.9 yrs. (mean: 59.3 
yrs.); two, 18 and six patients had pTl, pT2 and pT3 cancers, respectively; 15, ten and one 
patients had lymph node-negative, - positive and unknown disease, respectively) and normal 
controls (n=10; age range: 20.5 to 7L5 yrs. (mean: 44.6 yrs.); all underwent a core biopsy and 
were confirmed to have benign disease of the breast). 

To assess prognostic significance the inventors used pretherapeutic sera in independent train- 
ing (n=24) and test (n=62) sets consisting of patients who did not receive any adjuvant treat- 
ment after surgery. 



Systemic adjuvant therapy was either not necessary, or the patients were not eligible or refused 
any further treatment. The primary surgical procedure included breast-conserving lumpec- 
tomy or modified radical mastectomy and axillary lymph node dissection. Median age of the 
study population was 60 years (range, 28 to 86 yrs.). After a median follow-up of 3.7 yrs. 
(range: one month to 12.2 yrs.) 17 of the 86 patients (20 %) had died. Distribution of aberrant 
serum DNA methylation of the 86 patients and association with clinical and histopathological 
characteristics are shown in Table 2. 

Patients' blood samples were drawn prior to therapeutic intervention. The blood was centri- 
fuged at 2000 g for 10 min at room temperature, and 1 mL aliquots of serum samples were 
stored at -3 0°C., Genomic DNA from serum samples was.isolated using the High.Pure Viral 
Nucleic Acid Kit (Roche Diagnostics, Mannheim, Germany) according to the manufacturer's 
protocol with some modifications for multiple loading of the DNA extraction columns to gain 
a sufficient amount of DNA. Thus, 4 x 200ul of a serum sample were each mixed with 200ul 
of working solution (binding buffer supplemented with polyA carrier RNA) and 50ul protei- 
nase K [18 mg/ml] and incubated for 10 minutes at 72°C, After adding lOOul isopropanol the 
solution was mixed, loaded onto the extraction column and centrifuged for 1 minute at 8000* 
The flow-through was pipetted back into the same column reservoir and centrifuged a second 
time. This procedure was repeated four times for each serum sample. After these "pooling 
steps" the DNA isolation was processed as described in the manufacturer's protocol. For 
DNA elution 55ul of AE-buffer (Quiagen, CA, USA), were added, incubated for 20 min at 
45°C and centrifuged for three minutes at.12.00pg. For both, normal sera and cancer sera 
analysis the same amount of serum for DNA extraction was used. 

Sodium bisulfite conversion of genomic DNA was performed as described previously (Eads 
et al, MethyLight: a high-throughput assay to measure DNA methylation. Nucleic Acids 
Res., 28: E32, 2000). ' 

Sodium bisuifite-treated genomic DNA was analysed by means of the MethyLight, a fluores- 
cence-based, real-time PCR assay, as described previously (Eads et al 2000, see above, Eads 
et al.: Epigenetic patterns in the progression of esophageal adenocarcinoma. Cancer Res., 61: 
3410-3418, 2001). Briefly, two sets of primers and probes, designed specifically for bisulfite- 
converted DNA, were used: a methylated set for the gene of interest and a reference set, B-. 
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actin (ACTS), to normalise for input DNA. Serum samples of patients with recurrent disease 
revealed the highest amount of B-actin, whereas no difference between fi-actin values from 
serum samples of patients with primary breast cancer and sera, of normal controls was ob- 
served. Specificity of the reactions for methylated DNA was confirmed separately using Sssl 
(New England Biplabs)-treated human white blood cell DNA (heavily methylated). The per- ' 
centage of fully methylated molecules at a specific locus was calculated by dividing the 
GENE-ACTB ratio of a sample by the GENE:ACTB ratio of Swl-treated white blood cell DNA 
and multiplying by 100. The abbreviation PMR (percentage of fully methylated reference) 
indicates this measurement. For each MethyLight reaction lOul of bisulfite-treated genomic 
. DNA was used. 

A gene was deemed methylated if the PMR value was > .0.. Primer and probes specific for 
methylated DNA and used for MethyLight reactions are listed in Supplemental Data. 

The inventors used Pearsons Chi 2 or - in the case of low frequencies per cell - Fisher's exact 
method to test associations between categorically clinicopathological features. The Mann- 
Whitney-U-Test was used to assess differences between non-parametric distributed variables. 
Overall survival was calculated from the date of diagnosis of the primary tumour to the date 
of death or last follow-up. Overall survival curves were calculated with the Kaplan-Meier 
method. Univariate analysis of overall survival according to clinicopathological factors (his- 
tological type, tumour stage, nodal status,, grading, menopausal status, hormone receptor 
status (estrogen and/or progesterone receptor positivity), estrogen and progesterone receptor 
status) and gene methylation were performed using a two-sided log-rank test. 

Multivariate Cox proportional hazards analysis was used to estimate the prognostic effect of 
methylated genes. ... 

A p value < 0.05 was considered a statistically significant difference. All statistical analyses 
were performed using SPSS Software 10.0. . - 

The inventors initially investigated 39 genes in the sera of ten patients with metastasised 
breast cancer for the presence of aberrant methylation. The 33 genes positive in the sera of the 
metastasised patients were further evaluated in an independent sample set of pretherapeutic 
sera of 26 patients with primary breast cancer and ten healthy controls. An overview of the 



frequency of methylation in the investigated serum samples is given in Table 3. The most 
appropriate genes for our further analyses were determined to' be those that met one of the 
following criteria: (i) unmethylated in serum samples from healthy controls and > 10% meth- 
ylated in serum samples from primary breast cancer patients, or (ii) < 10% methylated in se- 
rum samples from healthy controls and >20% methylated in serum samples from primary 
breast cancer patients. A total of five genes, namely ESR1, APC, HSD17B4, HIC1 and 
RASSF1 A, met at least one of these criteria (Table 3). 

Pre-treatment serum samples from patients included in the training set were used to evaluate 
the prognostic value of the methylation status of these five genes. In this training set the in- 
ventors 'identified ESR1, APC or RASSF1A methylation in primary breast cancer patients' 
sera to be markers of poor prognosis, whereas HSD17B4 reached only borderline significance 
and aberrant methylation of HIC1 showed no significant results (Table 4). Furthermore, vari- 
ous combinations of the investigated genes were analyzed. Patients were classified as meth- 
ylation-positive if at least one of the genes included in the combination showed aberrant 
methylation. Patients with methylated serum DNA for RASSF1A and/or APC had the worst 
prognosis (P < 0.001), even worse than when each gene was analysed individually (Table 4). 

The highly significant prognostic value for APC and/or RASSF1A methylation in serum sam- 
ples from breast cancer patients was confirmed by analysing the test set (P = 0.007, log rank 
test). ESR1 and APC methylation as single genes or the combinations ESR1/RASSF1A and 
ESR1/APC no longer had prognostic significance (Table 4). . 

Combined analysis of the training and test sets (n=86) showed correlation between ESR1 and 
RASSF1A (P = 0.005) and between ESR1 and APC (P = 0.031), whereas no correlation was 
observed between RASSF1A and APC. In patients^ with advanced tumours RASSF1A and 
ESR1 methylation and in patients with progesterone receptor-negative tumours APC meth- 
ylation was more prevalent in pretherapeutic sera, while no further associations were seen 
between clinicopathological features and DNA methylation of APC, ESR1 or RASSF1 A. (Ta- 
ble 5). RASSF1A methylation in pretherapeutic sera was more prevalent in older than in 
younger patients, whereas age had no effect on DNA methylation of ESR1 or APC. 

Univariate analysis of all 86 investigated patients (training set plus test set) revealed prognos- 
tic significance for tumour size, lymph node metastases and methylation status of APC, 



RASSF1A and the combination of RASSF1A/APC (Table 6; Fig. 1). Due to the fact that 
ESR1 methylation correlates with APC as well as with RASSF1 A methylation, the inventors 
did not test the triple combination in the univariate or the multivariate analyses of all 86 pa- 
tients. 

The Cox multiple-regression analysis included tumour size, lymph node metastases, age and 
methylation status of the investigated genes. Beside lymph node status, methylated RASSF1 A 
and/or APC serum DNA was strongly associated with poor outcome, with a relative risk for 
death of 5. 7 (Table 7). 

Prognosis in patients with newly diagnosed breast cancer is determined primarily by the pres- 
ence or absence of metastases in draining axillary lymph nodes. . Nevertheless, the life- 
threatening event in breast cancer is not lymph node metastasis per se, but haematogenous 
metastases which mainly affect bone, liver, lung and brain. The inventors therefore aimed to 
develop a prognostic test that is sensitive for haematogenous metastases and could be per- 
formed in patients' pretherapeutic serum. 

In recent years several studies have reported cell-free tumour-specific DNA in serum/plasma 
of breast cancer patients at diagnosis. Aberrant methylation of serum/plasma DNA of patients 
with various types of malignancies, including breast cancer, has been described (see above). 

In light of these observations, the inventors examined the methylation status of 39 genes, 
which, on the one hand, are known to be frequently methylated in breast cancer and other 
malignancies. (Jones and Baylin: The fundamental role of epigenetic events in cancer. Nat. 
Rev. Genet., 3: 415-428, 2002; Widschwendter and Jones: DNA methylation and breast car- 
cinogenesis. Oncogene, 21: 5462-5482, 2002) and, on the other hand, were reported to be 
abnormally regulated in tumours of patients with poor prognostic breast cancer (van't Veer et 
al.: Gene expression profiling predicts clinical outcome of breast cancer. Nature, 415: 530- 
536, 2002; van de Vijver et al.: A gene-expression signature as a predictor of survival in 
breast cancer. N. Engl. J. Med., 347: 1999-2009, 2002) Because levels of circulating DNA in 
metastasised patients are known to be higher (Leon et al.: Free DNA in the serum of cancer 
patients and the effect of therapy Cancer Res., 37: 646-650, 1977) and because the loss of 
genetic heterogeneity of disseminated tumour cells with the emergence of clinically evident 
metastasis was recently reported (Klein et ah: Genetic heterogeneity of single disseminated 



tumour cells in minimal residual cancer. Lancet, 360: 683-689, 2002), the inventors firstly 
investigated, these 39 genes in ten sera of metastasised patients to determine the overall 
prevalence of methylation changes in breast cancer. As a next step the inventors analysed the 
33 genes that were positive in the metastasised patients, in the pre-treatment sera of 26 pa- 
tients with primary breast cancer and in ten benign controls in order to. identify the most im- 
portant genes for further analysis. Eventually the inventors came up with five genes (ESRl, 
APC, HSD17B4, HICl and RASSFIA), which were primarily analysed in a group of 24 pa- 
tients (training set). To confirm the significance of this result the inventors tested these genes 
in an independent set of 62 patients (test set). In order to apply the strictest criteria for testing 
the potential of a prognostic factor, the inventors investigated these markers in women, who 
had not undergone adjuvant systemic treatment. DNA methylation of APC and RASSFIA in 
pretherapeutic sera, both , frequently methylated, and abnormally regulated in human primary 
breast cancers (Dammann et al.: Hypermethylation of the cpG island of Ras association do- • 
main family 1A (RASSFIA), a putative tumor suppressor gene from the 3p21.3 locus, occurs 
in a large percentage of human breast cancers. Cancer Res., 61: 3105-3109, 2001; Virmani et 
al.: Aberrant methylation of the adenomatous polyposis coli (APC) gene promoter 1A in 
breast and lung carcinomas. Clin. Cancer Res., 7: 1998-2004, 2001), turned out to be a strong 
independent prognostic parameter. These genes are involved in pathways counteracting me- 
tastasis: mediation of intercellular adhesion, stabilisation of the cytoskeleton, regulation of the 
cell cycle and apoptosis (Fearnhead et al.: The ABC of APC. Hum. Mol. Genet, 10: 721-733, 
2001; Dammann et al.: Epigenetic inactivation of the Ras-association domain family 1 
(RASSFIA) gene and its function in human carcinogenesis. Histol. Histopathol., 18:. 665-677, 
2003). Methylated DNA in patients' pretherapeutic serum coding for these two genes reflects 
poor prognosis. The source of the rumour-specific DNA and its definite role in metastasis 
remains elusive. Circulating tumour-specific altered genetic information may serve as a sur- 
rogate marker for circulating tumour cells that ultimately cause distant metastases. An alter- 
native, but equally attractive, hypothesis is that circulating altered DNA per se may cause de 
novo development of tumour cells in organs known to harbour breast cancer metastases. This 
so-called "Hypothesis of Genometastasis" suggests that malignant transformation might de- 
velop as a result of transfection of susceptible cells in distant target organs with dominant 
oncogenes that circulate in the plasma and are derived from the primary tumour. Interestingly, 
irrespective of the source of DNA in the serum, it is noteworthy that some genes provide 
prognostic information when methylated in patients' sera, whereas genes like HICl, which is 



- methylated in about 40% and 90% of primary, and metastasised breast cancer patients, respec- 
: tively, but in only 10% of healthy individuals, are not at all a prognostic parameter. 

Irrespective of the mechanistic role of methylated DNA with regards to metastasis in breast 
cancer patients, these epigenetic changes in serum have several advantages as indicators of 
poor prognosis as compared to currently used or studied prognostic parameters: DNA in se- 
rum is stable and can be analysed by a high-throughput method like MethyLight. Compared 
to bone marrow aspiration, a simple blood draw (which can be repeated any time throughout 
the follow-up period) is sufficient. The more screening mammographies are performed, the 
more small cancers are treated and after histopathological examination no tumour material 
will remain to perform RNA- and/or protein-based assays for risk evaluation. This application 
therefore demonstrates a useful and easy approach for risk assessment of breast cancer pa- 
. tients. 



Table! 



Gene 


Genomic se- 
quence 


Bisulphite sequence 


HIC1 ' 
NM 006497 


1 


6, 7, 16 and 17 


HSD17B4 
NM 000414 


2 


8, 9, 18 and 19 


APC 

NM 000038 


3 


10, 11, 20 and 21 


ESR1 

NM 000125 


4 


12,13, 22 and 23 


RASSF1A 
NM 170715 


5 


14, 15, 24 and 25 



Table 2. Characteristics of training and test sets. 



Characteristics 


Training Set 


Test Set 


P 




(N=24) 


(N=62) 


Value* 




percent . 




Size of tumour . 






0.024 


Tl 


62.5 


79 




T2 


.37.5 


13 




T3+ T4 


0 


7 ' 




Histologic type 






n.s 


Invasive ductal 


67- 


63 




Invasive lobular 


. 8 


13 




Others 


• 25 


24- 





Tumor grade 






n.s 


1 


A£L 

4o 


44 




2 


3j 


OQ 

jy 




3 


17 


10 




Lymph node metastases 






n.s. 


No 


70 






Yes • 


1 0 < 


1 1 
1 1 




Unknown 




Z4 




Menopausal status 






n c 
U.S. 


Premenopausal 


TO 

J J 


10 




Postmenopausal 


67 


OA 

84 




bstrogen-receptor status 






n.s. 


Positive 


j4 






Negative 


42 


45 




Progensteron-receptor 






n.s. 


status 








Positive 


CO 


4j 




Negative 


38 


4U 




Hormone-receptor status 






U.o. 


Positive 


'63 


50 




Negative 


33 


36 





*P values for the comparison of numbers of patients were calculated by means of the Chi 
test, n.s., not significant; Median age: training set (54.2 years; 37.6-83.2), test set (65.7 years; 
28.2-86.2), P = 0.052 ; Follow-up: training set (8.0 years; 1 month to 12.2. years), test set (3.1 
years; 1 month to 1 1. years) P < 0.001. 
Tumour grade was unknown in six cases. 
Hormone-receptor status was unknown in ten cases. 
Tumour size was unknown in one case. 

Table 3, Frequency of methylated serum DNA in the gene evaluation set. 







Primary Breast Can- 


Recurrent Breast Can- 




Healthy Controls 


cer 


cer 


Gene 


(N = 10) 


(N = 26) 


(N = 10) 






percent positive 




ESR1 


0 


27 


' 70 


APC 


0 


23 


80 


HSD17B4 


0. 


12 


30 • 


CDH13 


0 


-8- 


40 


ESR2. 


0 


4 


20 


MGMT 


0 


4 


10 


SYK 


0 


. 4 


10 


HIC1 


10 


39 


90 


RASSF1A 


10 


23 


80 


GSTP1 


10 


12 


60 


MYOD1 


20 


. 27 


80 


CDH1 


20 


20 


90 



PTGS2 


a a 
JO 




i nn 


PGR 


.30 


40 


cn 

5U 


CALCA 


40 


£ A 

50 


£A 


HLAG 


ZTA 

60 


£A 
69 


1 AA 
100 


BLT1 


ZTA 

60 


o< 

OJ 


1 AA 

1UU 


ARHI 


1 AA 

100 


1 AA 
100 


1 AA 


MLLT7 


1 AA 

100 


1 AA 
100 


1 AA 


TFF1 


1 AA 

100 


1 AA 
100 


1 AA 


S0CS2 


A 

0 


A 
0 


AA 


SOCSl 


A 

0 


A 
0 


1f\ 
J\J 


TERT 


A 

0 


A 
0 


ICi 

jU 


DAPKl 


a 

0 


A 
0 


in 


TMP3 


A 

0 


A 
0 


OA 

ZU 


BRCA1 


a 

0 


A 

o 


OA 

zu 


GSTM3 


A 

0 


A 
0 


OA 
ZU 


MT3 


A 

0 


A 

0 . 


OA 
ZU 


TWIST 


A 

0 


A 
0 


1 A 


MLHl 


A 

0 


A 
0 


i n 


CYP1B1 


A 

0 


A 

u 


in 


TITFl 


A 

0 . 


A 

u 


1 n 


FGF18 


A 
0 


A 

0 




CDKN2A 


n.d. 


A 

n.a. 


A 

u 


HSPA2 


n.d. 


*i A 

n.d. 


A 

u 


PPP1R13 








B 


n.G. 


n.d. 


A 

u 


IrjjnrZ 


n a* 

n.u. 


11. u. 


0 


REV3L 


n.d. 


n.d. 


o • 


IGFB2 


n.d. 


n.d. 


0 



n.d., not done 

Table 4. Univariate analysis of methylation status in training and test sets. 



Genes 


Training Set 


Test Set 




. (N=24) 


(N=62) 




P Value 


P Value 


ESRl 


0.018 


0.555 


APC : 


0.002 


0.307 


HSD17B4 


0.056 




HIC1 . 


. 0.796 




RASSF1A 


0.042 


0.014 


RASSF1A/APC .. 


. <0.001 


• 0.007 - 


ESR1/APC : 


.0.001 


0.951 : 


ESR1/RASSF1A 


0.032 


. 0.138 



*P values for each variable were calculated by means of the log rank test. 



Table 5. Frequency of methylated genes according to clinicopathological features. 



Characteristics 


No. of 


ESR1 


APC 


RASSF1 


RASSF1 


Patients 






A 


A and/or 
APC 






% positive 


Size of tumour 












Tl 


64 . 


14 


11 


9 


. 19 


T2 


17 


12 


12- 


19 


31 


T3 + T4 


4 


75 


25 


50 


50 


Histologic type 








11 


22 


Invasive ductal 


55 


18 


15 


Invasive lobular 


10 


20 


0 


iU 




Others 


21 


10 


10 


10 


20 


Tumor grade 










21 


1 


38 


• 11 


11 


13 


2 




19 


16 


16 


31 


3 ' . 


10 


30 . 


10 


11 


11 


Lymph node metasta- 












ses 










18 


No 


58 


' 12 


9 


9 


Yes 


10 


20 


30 


20 


40 


Unknown 


1 o 

18 


28 


11 


22 


28 


Menopausal status 








11 


22 


Premenopausal 


18 


28 


11 


Postmenopausal 


68 


13 . 


12 


13 ■ 


22 


Estrogen-receptor 








16 


21 


Positive 


is 

JO 


16 


11 


Negative. 


o o 

38 




16 


11 


27 


Progensterone- 












receptor 








14 


18 


Positive 


42 


14 


5 


Negative 


34 


18 


24 


12 


33 


Hormone-receptor 












status 






t 




• 20 


Positive 


46 


15 


9 


15 


) Negative 


30 


17 


20 


10 


! 31 



Tumour grade was unknown in six cases. Hormone-receptor status was unknown in ten cases. 
Tumour size was unknown in one case. DNA methylation of RASSF1A for one case was 
missing.Chi 2 Pearson: Tumour size - ESR1 (P = 0.005); Tumour size - RASSF1 A (P = 
0.049); Progesterone-receptor - APC (P = 0,036); Median age - RASSF1 A methylated (79.0 
yrs.; 49.6 to 86.2), RASSF1A unmethylated (59.4 yrs.; 28.2 to 82.3.) P = 0,009 



Table 6. Results of univariate analysis. 



Variable. 

Size of tumour 
Tl 

T2 . 



No. of Patients 
Who Died/Total 
No. 

10/64 
. 5/1-7 



Relative Risk of Death 
(95% CI) 



2.2 (0.6-7.8) 



P 

Value 

0.018 



nr*o « hp A 

T3 + T4 


0/4 
Z/*f 


c a /n 7 - 47 Ctt 




Histologic type 


U/jJ 




V.ZyO 


Invasive ductal 






Invasive lobular 


1 /1 n 

1/1U 


0 A (Ci - ^ n 




Others 


3/21 


U.D J.U.1 - Z.1J 




Tumor grade 






n ^ 1 n 
U.o 1U 


1 


O/JO 






2 


QA9 
JZ 


9 WO 7 - 6 74 




3 


z/lU 


n mo 7 q\ 

1 .J ^U.Z - 1 .7) 




Lymph node metastases 






U.UUJ 


No 


//JO 






Yes 


^/1 n 


7 3 0 7-317") 




Unknown 


</1 c 






Menopausal status 






0.062 


Premenopausal 


1 /1 . 
1/1 o 






Postmenopausal 


1 A/AC 
lO/Oo 


9 (0 fi -49 4^ 


0.369 


E/Strogen-receptor 




1 0 f 0 fi - 5 <^ 


Positive 


THAR 




Negative 


o/io 




0.766 


rrogensterone-recepior 






rositive 


Q/49 


1 1 (0 3 - 3 2^ 




Negative 


nil a 




0 799 


Hormone-receptor 






status 








Positive 


1 0/4fi 


1 1 (0 4 - 3 51 




Negative 


f\ An 

0/jU 




. 0.370 


boKl metnyiation 






Unmetnyiatea 


11/79 


1.8 (0.5-6.7) 




Methylated 


Ah A 


0.001 


Ar C metnyiation 






T T . - _ _ 4.1. _ .1 ^X—. J 

Unmetnyiatea 


1Z/ /0 


5.3(1.3-21.3) 




Methylated 


^/1 fi 


0.001 


RAbor 1 A metnyiation 






unmetnylated 


1 1 1HA 
III IH 


6 9 H 8 - 26 5^ 




Metnyiatea 


fi/1 1 
O/ 1 1 


<0.001 


KAoorl/ArC metn- 






yiation 








Unmethylated 


7/66 


9-5(2.9-31.4) 




Methylated 


10/19 





Table 7. Multivariate Analysis. 



Variable 


Relative Risk of Death 


P 




(95% CI) 


Value 


Size of tumour 




0.19 


T2 (vs. Tl) 


2.7 (0.8-9.3) 




T3 + T4 (vs. Tl) 


2.9(0.4-20.5) 


6.039 


Lymph node metastases 


3.9 (1;1-13.9) 


Yes (vs. no lymph node metastases) 





•3? ~ • • « • m • • • •••• 

• • ••• • * • • 



UIlKIlOWn ( VS. nu lyjlipii ijajuc liiwiaoLa^^o^ 


5 2 fl 2-22 4^ 




/Vgc 


1.0 (1.0-1.1) 


0.06. 


RASSF1 A and/or APC methylated (vs. unmethyl- 


5.7(1.9-16.9) 


0.002 


ated} 








I.- 



• <• • • • * 
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Claims 



1. A method for determining the prognosis of a subject with a cell proliferative disorder 
of the breast tissues, said method comprising 'analysing the methylation pattern of a target 
nucleic acid comprising one or a combination of the genes taken from the group consisting of 
ESR1, APC, HSD174B4, HIC1 and RASSF1A and/or their regulatory regions by contacting 
at least one of said target nucleic acids in a biological sample obtained from said subject with 
at least one reagent, or series of reagents that, distinguishes between methylated and non- 
methylated CpG dinucleotides. 

2. : A method for selecting a treatment of a cell proliferative disorder of the breast tissues, 
said method comprising: 

a) determining the prognosis of a subject according to Claim 1, and 

b) selecting a suitable treatment according to said prognosis. 

3. A method for determining the phenotype of a subject with a breast cell proliferative 
disorder comprising 

a) obtaining a biological sample containing genomic DNA from said subject, 

b) analysing the methylation pattern of one or more target nucleic acids comprising one or a 
combination of the genes taken from the group consisting of ESR1, APC, HSD174B4, HIC1 
and RASSF1A and/or their regulatory regions by contacting at least one of said target nucleic 
acids in the biological sample obtained from said subject with at least one reagent, or series of 
reagents that distinguishes between methylated and non-methylated CpG dinucleotides, and 

c) determining the phenotype of the individual by comparison to two known phenotypes, a 
first phenotype characterised by hypermethylation of the target nucleic acid and poor progno- 
sis as relative to a second phenotype characterised by hypomethylation of the analysed target 
nucleic acid and positive prognosis. • 

4. A method according to claims 1 to 3 wherein said prognosis is the life expectancy.of 
said subject. 

5. A method according to any one of claims 1 to 4 wherein said target nucleic acid com- 
prises the gene APC and/or its regulatory regions. 



• « • • • 



6. A method according io any one of claims 1 to 4 wherein said target nucleic acid com- 
prises the gene RASSF1 A and/or its regulatory regions. 

7. A method according to any one of claims 1 to 4 wherein said target nucleic acids 
comprise the genes APC and RASSF1 A and/or their regulatory regions. 

8. A method according to any one of claims 1 to 7, wherein said target nucleic acid or 
acids comprise essentially one or more sequences from the group consisting of SEQ ID NOs: 
1 to 5 and sequences complementary thereto. 

9. A method according to claim 5 wherein the sequence of said target nucleic acid com- 
prises essentially of SEQ ID NO: 3. 

10. A method according to claim 6 wherein the sequence of said target nucleic acid com- 
prises essentially of SEQ ID NO: 5. 

11. A method according to Claim 7, wherein said target nucleic acid or acids comprise 
essentially of SEQ ID NOs: 3 and 5. 

12. A method according to Claims 1 to 1 1, wherein. said cell proliferative disorder of the 
breast tissue is selected from the group consisting of ductal carcinoma in situ, lobular carci- 
noma, colloid carcinoma, tubular carcinoma, medullary carcinoma, metaplastic carcinoma, 
intraductal carcinoma in situ, lobular carcinoma in situ and papillary carcinoma in situ. 

13. A nucleic acid molecule consisting essentially of a sequence at least 18 bases in length 
according to one of the sequences taken from the group consisting of SEQ ID NOs: 6 to 25 
and sequences complementary thereto. 

14. An oligomer, in particular an oligonucleotide or peptide nucleic acid (PNA)-oligomer, 
said oligomer consisting essentially of at least one base sequence having a length of at least 
10 nucleotides which hybridises to or is identical to one of the nucleic acid sequences ac- 
cording to SEQ ID NOs: 6 to 25. 





15. The oligomer as recited in any one of Claims 13 or 14 , wherein the base sequence 
includes at least one CpG dinucleotide. 

« 

16. A set of oligomers, comprising at least two oligomers according to any of claims 1 3 or 
14. 

17. A set of oligonucleotides as recited in Claim 16, characterised in that at least one oli- 
gonucleotide is bound to a solid phase. 

18. A set of at least two oligonucleotides as recited in any of claims 14 or 15, which is 
used as primer oligonucleotides for the amplification of nucleic acid sequences comprising 
one of SEQ ID NOs: 6 ..to 25 and sequences complementary, thereto. 

19. Use of a set of oligonucleotides comprising at least two of the oligomers according to 
any of claims 16 to 18 for detecting the cytosine methylation state and/or single nucleotide 
polymorphisms (SNPs) within the sequences taken from the group SEQ ID NOs: 1 to 5 and 
sequences complementary thereto. 

20. A method for manufacturing an arrangement of different oligomers (array) fixed to a 
carrier material for predicting the responsiveness of a subject with a cell proliferative disorder 
of the breast tissues by analysis of the methylation state of any of the CpG dinucleotides of 
the group SEQ ID NOs 1 to 5 wherein at least one oligomer according to any of the claims 14 
or 15 is coupled to a solid phase. 

21. An arrangement of different oligomers (array) obtainable according to claim 20. 

22. An array of different oligonucleotide- and/or PNA-oligomer sequences as recited in 
Claim 21, characterised in that said oligonucleotides are arranged on a plane solid phase in the 
form of a rectangular or hexagonal lattice. . ■ 



23. The array as recited in any of the Claims 21 or 22, characterised in that the solid phase 
surface is composed of silicon, glass, polystyrene, aluminium, steel, iron, copper, nickel, sil- 
ver, or gold. 



24. A DNA- and/or PNA-array for predicting breast cell proliferative disorders 1 response 
by analysis of the methylation state of any of the CpG dinucleotides of the group SEQ ID 
NOs: 1 to 5 comprising at least one nucleic acid according to any of the claims 14 to 18. 

25. A method according to any one of Claims 1 to 4 comprising the following steps: 

a) obtaining a biological sample containing genomic DNA, 

b) extracting the genomic DNA, 

c) converting cytosine bases in the genomic DNA sample which are unmethylated at the 5- 
position, to uracil or another base which is dissimilar to cytosine in terms of base pairing be- 
haviour, 

d) amplifying at least one fragment of the pretreated genomic DNA, wherein said fragments 
comprise one or more sequences selected from, the group consisting of SEQ ID NOs: 6 to 25 
and sequences complementary thereto, and 

e) determining the methylation status of one or more genomic CpG dinucleotides by analysis 
of the amplificate nucleic acids. 

26. The method as recited in Claim 25, characterised in that Step e) is carried out by 
means of hybridisation of at least one oligonucleotide according to Claims 14 or 1 5. 

27. the method as recited in. Claim 25, characterised in that Step e) is carried out by 
means of hybridisation of at least one oligonucleotide according to Claims 14 or 15 and ex- 
tension of said hybridised oligonucleotide(s) by at least one nucleotide base. ■ 

28. The method as recited in Claim 25, characterised in that Step e) is carried out by 
means of sequencing. 

29. The method as recited in Claim 25, characterised in that Step d) is carried out using 
methylation specific primers. 

30. The method as recited in Claim 25, further comprising in step d) the use of at least one 
nucleic acid molecule or peptide nucleic acid molecule comprising in each case a contiguous 
sequence at least 9 nucleotides in length that is complementary to, or hybridises under moder- 
ately stringent or stringent conditions to a sequence selected from the group consisting of 
SEQ ID NOs: 6 to 25, and complements thereof, wherein. said nucleic acid molecule or pep- 
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tide nucleic acid molecule suppresses amplification of the nucleic acid to which it is hybrid- 
ised. 

31. The method as recited in Claim 25, characterised in that Step e) is carried out by 
means of a combination of at least two of the methods described in Claims 26 to 30. 

32. The method as recited in Claim 25, characterised in that the treatment is carried out by' 
means of a solution of a bisulfite, hydrogen sulfite or disulfite. 

33. A method according to any one of Claims 1 to 12 comprising the following steps: 

a) obtaining a biological sample containing genomic DNA, 

b) extracting the genomic DNA, 

c) digesting the genomic DNA comprising one or more of the sequences from 
the group consisting of SEQ ID NOs: 1 to 5 and sequences complementary thereto with one 
or more methylation sensitive restriction enzymes, and 

d) determining of the DNA fragments generated in the digest of step c). 

34. A method according to claim 33, wherein the DNA digest is amplified prior to step d). 

35. The method as recited in any one of Claims 25 to 32 and 34, characterised in that more 
than six different fragments having a length of 100 - 200 base pairs are amplified. 

36. the method as recited in any one of Claims 25 to 32, 34 and 35, characterised in that 
the amplification of several DNA segments is carried out in one reaction vessel. 

37. The method as recited in any one of Claims 25 to 32 and 34 to 36, characterised in that 
the polymerase is a heat-resistant DNA polymerase. 

.38. The method as recited in any one of Claims 25 to 32 and 34 to 37, characterised in that 
the amplification is carried out by means of the polymerase chain reaction (PCR). 

• 39. The method as recited in any one of Claims 25 to 32 and 34 to 38, characterised in that 
the amplificates carry detectable labels. . 



40. - The method according to Claim 39, wherein said labels are fluorescence labels, radio- 
nuclides and/or detachable molecule fragments having a typical mass which can be detected 
in a mass spectrometer. 

41 . The method as recited in any one of Claims 25 to 32 and 34 to 40, characterised in that 
the amplificates or fragments of the amplificates are detected in the mass spectrometer. 

42. The method as recited in any one of the Claims 40 or 41, characterised in that the pro- 
duced fragments have a single positive or negative net charge for better detectability in the 
mass spectrometer. 

43. The method as recited in any one of Claims 40 to 42, characterised in that detection is 
carried out and visualised by means of matrix assisted laser desorption/ionisation mass spec- 
trometry (MALDI) or using electron spray mass spectrometry (ESI). 

44. The method as recited in any one of the Claim 1 to 25 or one of the Claims 25 to 43, 
characterised in that the genomic DNA is obtained from cells or cellular components which 
contain DNA, sources of DNA comprising, for example, cell lines, histological slides, biop- 
sies, tissue embedded in paraffin, breast tissues, blood, plasma, lymphatic fluid, lymphatic 
tissue, duct cells, ductal lavage fluid, nipple aspiration fluid and combinations thereof. 

45. A kit comprising a bisulfite (= disulfite, hydrogen sulfite) reagent as well as oligonu- 
cleotides and/or PNA-oligomers according to any one of the Claims 14 to 15. 

46. A kit according to claim 45, further comprising standard reagents for performing a 
methylation assay from the group consisting of MS-SNuPE, MSP, Methyl light, Heavy 
Methyl, nucleic acid sequencing and combinations thereof 

47. The use of a method according to any one of claims 1 to 12 and 25 to 44, a nucleic 
acid according to claim 13, of an oligonucleotide or PNA-oligomer or a set thereof according 
to any one of claims 14 to 18, of a kit according to Claim 45 or 46, of an array according to 
any one of claims 21 to 24 or of a method of manufacturing an array according to . claim 20 in 
the prognosis, diagnosis, treatment, characterisation, classification and/or differentiation of 
breast cell proliferative disorders. 



Epigenomics AG 
E30120 . ' 

Abstract 

The present invention relates to modified and genomic sequences, to oligonucleotides and/or 
PNA-oligomers for detecting the cytosine methylation state of genomic.DNA, as well as to a 
method for the prognosis and treatment of a cell proliferative disorder of the breast tissues. 
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Sequence listing 

<110> Epigenomics AG 

<120> Prognostic and diagnostic markers for breast cell proliferative 
disorders. 

<160> 25 

<210> 1 

<211> 3501' 

<212> DNA 

<213> Homo Sapiens 



<400> 1 

gcggggctgg 
atcggctaag 
acgggccgcg 
aggggagctc 
ctgcgagccg 
tttccccaag 
gaggcgccgg 
gcccgggtca 
tcccctggga 
ccccgatata 
ccgctgcctc 
cggccagggc 
tggggcaact 
ggtaaagttc 
actgtctcca 
gccaggtgcc 
tcctgctcct 
gtccttcgcc 
gtggagggga 
ggccggtgtg 
cgcccggcca 
tgtgcgacgt 
cggccagcag 
atgacatggt 
tggctgacgg 
tgggcgccgt 
agaaacgcct 
gcggctacgc 
aggcctgcta 
gcccagaggc 
ccgccgcact 
agaagagccc 
cgcgcccgga 
tgccgtcgct 
catttcgcgg 
gtctcctcta 
-gccgggagcg' 
cgcccggggg 
ggcccggcgc 
gcgaggaccc 
atggcgagcc 
tccccagctc 
tgtacggcag 
ttggaggcgg 
cctaccgctg 
agaagacgca 
agcgtgggac 



caggggcgct 
agctgccacc 
gggccccgcg 
agggctcggc 
agggcctggg 
tttggggcgg 
gcccgcgcgt 
gggccgcagc 
gctgcgtggc 
acgccctccc 
cgctccccgc 
ggcgccaggg 
tctcccgagg 
tccgccctga 
aaagggtcac 
ctggggcgtg 
tctcctggtc 
ggtgcccagg 
gaagtgccgg 
tgtccccgca 
ctccaggcag 
gatcatcgtg 
cgcctacctc 
gagcccggcc 
cgcagaggcg 
gctggccgcc 
caagcgccac 
gccctatggt 
cccgtcccca 
cgc$gtcaac 
ctgtgcctcg 
gccgggctcc 
cagccctccc 
gccgccgctg 
cggcagcggc 
tcgctggatg 
' cggctccccc 
gcccccgctc 
gggcggcgac 
cagcccgcct 
cgagagcttc 
tgagcagctg 
ggccgaggcg 
cggggacaag 
cgcgtcgtgc 
ctggctgacc 
catgacgcgc 



gccctggcac 
gccgcgggga 
ggccaggagg 
tccgggctct 
gccggcgcac 
cggagttccg 
gtagggccca 
cggctgtgcg 
tcccccctcc 
cgcgccgggc 
gcggccgccg 
.cgggcaccgc 
cgggaggcgc 
atgacttttc 
tgcgcctgaa 
caggccgccc 
cgggcgggcc 
ccgcagggct 
gctggggcca 
ggagagtgtg 
ctgctgctgc 
gtgcagaacg 
aagtccctgg 
gtgttccgcc 
gctgcggccg 
gccagctacc 
ggcaagtact 
cggccgggcc 
gtcgggcctc 
acgcactgcg 
gagcgccgct 
gcggcgccag 
agcgccggcc 
cccttccaga 
agcccgggac 
aagcacgagc 
agcgagcgct 
ggcctggcgc 
ggcgacgact 
ggcggccacc 
ggtgacaacc 
aacgcgcacg 
gccgaagtgg 
gtcgccgggg 
gacaagagct 
cggccctacc 
cacatgcgca 



agctcggggc 
ggggagcccg 
ggaacggggt 
gccgccggat 
tcctcccgcc 
ggggagaagg 
ggccgaggcc 
ccgtgcccgc 
cccccacctg 
ccggccttcg 
cccgggcccc 
gctcccctcc 
tggttcctcg 
ctgaagcgga 
cagttttctt 
tggc.ctcccc 
•ggcctgggct 
gatgcccccg 
ggcggccagg 
ctg'ggcagac 
agctcaacaa 
ccctcttccg 
tggtgcatga 
tggtgctgga 
cggccgtggc 
tgcagatccc 
gccacctgcg 
ggggcctgcg 
cgccgccgcc 
ccgagctgta 
gctcccctct 
agcggccgct 
ccgccgcct'a 
agctggagga 
ccgagccccc 
cgggcctggg 
gcgaagagcg 
cgccgccgcg 
acaagagcag 
tcgagggcta 
tgtacgtgtg 
tggaggctca 
ccgctggggc 
ctccgggtgg 
acaaggaccc 
catgcaccat 
gccacctggg 



ctggcagcgg 
gcccgccggg 
cgggcgggcg 
ttgggggccg 
ctgtctgcag 
ggccggggga 
gggacgcggg 
ccggggcgct 
cttcctgcct 
cgctctgccc 
gaccgagggt 
tccgtatcac 
gctccctttc 
cattttactt 
ctcggaagcc 
tccaccggcg 
cccactccag 
ctcagctgag 
gcgccgcacg 
gatgctggac 
ccagcgcacc 
.cgcgcacaag 
caacctgctc 
cttcatctac 
cccgggggct 
cgacctcgtg 
gggcggcggc 
ggccgccacg 
tgccgcggag 
cgcgtcggga 
ttgtggcctg 
ggctgagcgc 
caaggagccg 
ggccgcaccg 
cggccgcccc 
tagctatggc 
tggtggggac 
ctaccctggc 
cagcgaggag 
cccatgcccg 
cattccgtgc 
cgtggaggag 
cgccggccta 
cctgggagag 
ggccacgctg 
ctgcgggaag 
cctcaagccc 



cgggtggggc 
accgcaggta 
agcagcgggc 
cgagga'agag 
ttggaaaact 
gccgcggagg 
tggggcgcag 
gccccctccc 
cagcctcctg 
gccacggcag. 
tgacagcccc 
ttcccccaac 
tccctacttg 
aaatcgggta 
ccagcaccca 
gccgctcacc 
agggcagccg 
ggaaggggaa 
gctctcacce 
acgatggagg 
aagggcttct 
aacgtgctgg 
aacctggacc 



accggccgcc 

gagccgagcc 

gcgctgtgca 

ggcggcggcg 

ccggtcatcc 

ccgccctcgg 

cccggcccgg 

gacctgtcca 

gagctgcccc 

cctctcgccc 

ccttccgacc 

gacgggccta 

gacgagctgg 

gcggccgtct 

agcctggacg 

accggtagca 

cacctggcct 

ggcaagggct 

gaggaagcgc 

gggccccctt 

ctgctgcggc 

cggcagcacg 

aagttcacgc 

ttcgcgtgcg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
'660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
• 1740 
1800 
1860 
1920 
1980 
2040 
.2100 
2160 
2220 
2280 
' 2340 
2400 
2460 
2520 
2580 
2640 
2700 
"2760 
2820 
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acgcgtgcgg 
cgggcgagaa 
■tcatcagcca 
cgggcttggg 
ccgagggcgt 
acaaggcggc 
tggcgctgga 
acaaggcggc 
ccatcgaccg 
gcggccctgg 
ggacaaccgc 
ctgcttcgtg 



catgcggttc 
gccctacgag 
catgaagatg 
ggggctcccc 
ctttgctgtg 
cgcggccgag 
gagcctctac 
cgaggtgctg 
tttctctccc 
cccgcacccc 
agcgtcgcca 
ccttagctcg 



acgcgccagt 
tgccaggtgt 
cacgccgtgg 
ggcgtccccg 
gctcgcctca 
ctgctggcgc 
ccgctggcca 
agccagggcg 
acctagagcg 
agggagcggc 
cagtggcggc 

g 



accgcctcac 
gcggcggcaa 
ggggcgcggc 
gccccgacgg 
cggccgagca 
agaccacgca 
agttcacggc 
ctcacctggc 
cccctcgcca 
gggggcggcg 
tccacctctc 



ggagcacatg 
gttcgcacag 
cggcgcggcc 
caagggcaag 
gctgagcctg 
cttcctgcac 
cgagctgggc 
ggccgggccc 
gcccgctctg 
cgcagggccc 
ggcggcctca 



cgcatccact 
caacgcaacc 
ggggcgctgg 
ctcgacttcc 
aagcagcagg 
gaccccaagg 
ctcagccccg 
gacggccgga 
tcgctgctgc 
actgtgccc'g 
cctggcctca 



<210> 2 

<211> 2501 . 

<212> DNA- 

<213> Homo Sapiens 



<400> 2 

tttccatagt 

cagcacaaaa 

tctttctccc ' 

gggtcgccac 

ggatagggtc 

ggctgcaccg 

actcaggaga 

agatcattta 

aagcagtggg 

gcagggatta 

gctatctctg 

aaacccagaa 

gtgtcgagtt 

aaatggcccc 

ggattccaag 

gaccctcgtc 

gctctgcttg 

cgacgggcgg 

ccgcgcccct 

gcagctgagg 

atctcttgag 

tccccagcac 

ctgctgattg 

tactcttcct 

cgcgaggggc 

tcattcggtc 

cttctttaaa 

gtcacaatct 

acaggggtca 

ctggagagca 

gcggcaaaat 

aacaaaccat 

agtgggtgac 

ccactggtag 

taaacgttac 

taaagtggct 

attttatacc 

ccttctaaat 

attttacaac 

gaggtccagg 

ggggctatat 

cttacacatg 



gtaaatgtgt 
ttcaaacatc 
caaacctagc 
tttaggtcct 
tgtttatttt 
ttattgaaag 
tgctcagtaa 
ggaacacttg 
ggtccaggac 
ttacactgac 
gaagcccctc 
aacgtttatt 
aactattctt. 
tggaaggcaa 
ggagccgcgc 
ccgcccccgc 
ttcgtgtgtg 
gtggtactgg 
tgctgaggcg 
tcaccccgct 
gaggaaagag 
cccggtgtgg 
caaaactggg 
gcaattaact 
gggggctgga 
gttcagtgtt 
gagggcgtct 
gttctcgaaa 
ggcgtatgat 
gagaattccg 
ggaaaattgt 
tcataccaac 
ctctgtaacc 
taccttatac 
aaactggaat 
.tcttatgctg 
.catagctgtc 
gtttttagga 
gcttggtgtt 
gattacattc 
ctgaaaatca 
gtatgcagta 



tcccaccact 
taaacatgat 
tagtgtttgt 
ggtcctcctt 
gacaccatta 
tttaccatct 
atgttgattg 
tcaaaatgtt 
tctgcatttt 
aaatctttac 
aaggataggg 
gagtgaatga 
ctacacaagt 
gtcttccagc 
gctctgcgct 
cattccccgc 
tgtcgttgca 
tcaccggcgc 
cagctggctg 
gaggtggtgg 
ccggaaacac 
gcttcccaag 
tgaaagttct 
ctcttagatc 
attcgccccc 
tgtttccctc 
gataagagcc 
tccactggag 
ggagaaagaa 
aagtggaccg 
tatgtgaaat 
atagatgctg 
tccaagtgca 
ctggaaaggc 
tctgtcaatt 
.cctttcagaa 
tggttat.ctc 
aaacattttg 
ttaaatcttt 
taagacgttt 
ttcaaggata 
cattttacac 



ctctggagta 
cttgctggct 
gctgcctgta 
cagaaagttt 
gctcacttac 
gactctgagt 
aactatgact . 
tttgcccctt 
gacagtcccc 
catccctaag 
gccgcatgct 
tgaaacgaca 
cctagcagct 
gagtggagtc 
catcttccta 
ctcctcctgt. 
ggccttattc 
gggggcaggt 
ctcttttcgg 
ggaggggaat 
ctggtctctc 
gtcctgcctg 
ccctgaccct 
tttgcctagt 
tgaagcgcag 
tgtcgtaaga 
cttccccgtt 
tcttggcaga 
aatgggagta 
gccagtggga 
agccccattt 
tgcagtgaga 
gggatcttga 
cctaatgcat 
aattcctatg 
aatgctttca 
tgcatgggca 
tttacacttt 
tttgacagct 
.ttgccatcgc 
•gggactgctt 
cagtactcag 



atcctactta 
ttgcttttgt 
atgcccttct 
ttcctctttc 
acacattggt 
agcttgagga 
gttctcaaca 
gagattctat 
tgatgtttgc 
ggggactttc 
gtttctctag 
ggtgaataga 
cccattgctt 
actcttaact- 
ccagaaitcg 
cccgcagtcg 
atgggctcac 
gagcatgcga 
gccggcatac 
ggttattctt 
aagcaggtac 
agaggagagg 
tatctgtggg 
cttttaaagg 
agatgtcagc 
ttttaagttc 
ggagtttgta 
ggttgtaagc 
ggatgggcac 
aagttgcctg 
tttaaagtac 
ttttacatta 
cattatgcac 
gaattatttg 
tactttcata 
gtgttgatga 
tgtatttggg 
gcttttattg 
cttggataat 
taaggagact 
cttttgacac 
t 



aaaccgtttt 

ggctttaccc 

ttctttgcag 

tccccagcgg 

cacaagtcta' 

tcctatcaaa 

tacaaacgca 

tttgggaggt . 

atgtagaagt 

cttcccaggg 

gtcagcaact 

tgaacgcaag 

ccagccgcag 

acatttccca 

gcaagtcact 

gcgtccagcg 

cgctgaggtt 

aggttggagg 

gcgcgcagcc 

gaggcaccgc 

agcccgcttc ■ 

ccaggctggg 

catcgattgt 

actgaaaagc 

tcctgaaaag 

gtgagaggac 

tgcttagcaa 

tcaaatgcgc 

atctgaggaa 

tatttcagga 

aaaaaattaa 

gtttctcacc 

ctttgattct 

agttatatat 

tc'tgtattga 

atagccaagt 

tgtagtcata 

taaataatgt 

tttcatgcag 

ttccttttca 

cattagcata 



2880 
2940 
3000 
3060 
3120' 
3180 
3240 
3300 
3360 
3420 
3480 
3501 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
•1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
• 2340 
2400 
2460 
2501 
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<210> 3 

<211> 2470 

<212> DNA 

<213> Homo Sapiens 



<400> 3 

aaagatgatt 
agggtacttt 
■ ggttataact 
ttttctctta 



gagaaatttt 
t'tataatgtg 
. tcactcctgt 
accccttgcc 
aagaggttat 
. ttttagcata 
ttgtaagtgt 
atgaccaggc 
gtgatccctg 
actgtagatt 
tggtagagtc 
gtctgcctta 
gtttaagtat 
aggcaggatt 
cctgaatctt 
tcctcttacc 
taactgccag 
tctttaggta 
agcataggct 
cttatttacc 
tccattttgt 
tggaggatgt 
tcattaatcc 
agtattacta 
aatgggtaaa 
tctgtgagaa 
tttatcttga 
tgttttgaga 
gtttccttta 
aaaatccttt 
tcagttgtta 
agaagataat 
aagactgtga 
tgaggtaaga 
tcttgcaaaa 
atataaagaa 
taaagagaag 
tagaaactag 



aaaagtttaa 
tagcagtctc 
tctgtgtctc 
ttttgtttta 
aaggttcaca 
ctgggtattt 
ggatttttcc 
cattgatcag 
gttacatgta 
ccttttttta 
tcatttacaa 
ctgaaattaa 
cctgcaggac 
ctagccgttg 
ttccaattgc 
ttttgtgtaa 
tgcttaagtg 
ttatttgttc 
acttggttaa 
tggctatctt 
actctgcttc 
tcttttggcc 
attatagtca 
atcttcctat 
taaatgattg 
gtaagattag 
attctggctc 
atataaataa 
.atatctttaa 
ggcaaatgta 
actaagactc 
ccaaaaacta 
cccctttctt 
ttaaccttat 
aagcaagttg 
tccaatcatc 
cttttaattg 
cactttactt 
gttagcattt 
aataagtgta 
agaccaaata 



ttgttcatct 
tgctcatctt 
agtaaacttg 
ttattttgat 
ttattcaaat 
tcagacctgc 
tttatttttg 
a'ggaggggtc 
tattttcata 
ttaatcctaa 
attgtgaaat 
cacacaagac 
tccccttcct 
tagaactaac 
tgaactccaa 
tggatttcat 
caaactactt 
caaaatgatt 
• tcttaatata' 
ttgtatcttc 
atttctcttt 
tcagtttgag 
accttttgaa 
caaaaaggct 
ccttttaatg 
ccttttatca 
taaaattctg 
atttttatca 
aaccttgaag 
ttcagacaca 
cactgtttca 
gcatattaac 
taaaaacaag 
aggtccaagg 
aggcactgaa 
ttacaaaact 
tagtttatcc 
aaaagtgtat 
atatttttaa 
aaactccaat 
tagcatcact 



gaagagttga 
gcccatccgg 
tgcatgccca 
gtaaaattac 
tctgtcagat 
ttattaaaaa 
agatgtctcc 
ttaactatgg 
atataactta 
•taatatcact 
gaattaaaat 
tcaatttttt 
cctcagatgt 
tagatctaag 
tattgtcgtg 
attacagagg 
aatacttttt 
tgacctaaac 
actcctagca 
tttgtctctc 
gacagtctct 
cacagcagat 
taaatgtgat 
taaagtcttc 
acatcttaga 
aataaaaagt 
tgactatcag 
taattatatt 
aaattattac 
actaaagttc 
tcctcttaga 
acaattcttc 
cagccactaa 
gtagccaagg 
gatggagaac 
ggaaactgag 
atttttattc 
tttaaattaa 
ataagatata 
ggctagttag 
gtactgagg'6 



tttttttatt 
ctctttttgt 
tttttttctc 
ctgttaattt 
ccctacctct 
gatgtaaaac 
tttggctgca 
gtgaacccta 
catttacata 
•gtaagttatg 
gaaagggcaa 
tcaaccaaag 
cattggattg 
atgagtcccc 
actagccagt 
cattttttta 
agctattaag 
taaaaagaga 
ttctataatt 
ctcttctttc 
actcctaagg 
cccaagacca 
tgaactttat 
atttaatgct 
acttcagaac 
gtgaaatgga 
ataaaattca 
tcctaagttt 
ttgatagaaa 
tctcttctat 
tgctgctact 
ttaaacgtct 
attttttagt 
atggctgcag 
tcaaatcttc. 
gcatctaata 
agtattccct 
gcaataatat 
ttgaattcat 
ttcttagttc 
aaggttttct 



cctgtaataa 
ggttgtgtaa 
tgttactacc 
tatttgaaat 
gtcatatggt 
aaaataatga 
ttacttcttc 
tatcttactg 
gtacttttat 
ttgaagcaga 
agattaaatc. 
acttttgtag 
taccaggttt 
tgatttcctt 
gttacaacct • 
atgtcaagat 
taattaagat 
atgtggatct . 
cttcctaaag 
ccagtcataa 
tcatccattc 
catatgccat 
gttagtaatt 
ctccttcatg 
tatttcacca 
atatgtaatc 
gaaataaaat 
tgcctgtaag 
gtttaatcca 
tttaatttca 
tgaacaatat 
taagagtttt 
agtgaatttc 
cttcatatga 
gacaagagct 
tgaaggtatc 
cttgtaaact 
gtaaactctt • 
tcagtgaatc 
tttttaagat 
gtgtagttca 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
. 1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800- 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2470 



<210> 4 

<211> 7001 

<212> DNA 

<213> Homo- Sapiens 

<400>' 4 



aatgcaatgg 
gtgtggaagg 
ctggagagac 
gaatggtgac 
ctggagacac 
cagtaagtca 



aaaaagagag 
caagggaaaa 
agtgatgctg 
tcatatttga 
attcaacgga 
cggctggaac 



attgtaaagc 
tcagccctcg 
ggcacagaca 
acaagcctgc 
ggagccagat 
gtgtgtgtcc 



tagaaggctt 
aagaagacag 
cggggaagtt 
aatgcccagc 
caatctttac 
agcaggagag 



aggaattgcc 
tgagatttta 
gagaggaaca 
agaccgctgg 
ccttc'ttcac 
ggtagggagg 



tcttgattag 
atctgggtgg 
ccatgtttga 
aaaagtgggg 
ctgagagagc 
gaagccaaga 



. 60 

120 

180 

240- 

300 
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gagctgggag 
gtatactttc 
aagagaatgc 
ctttgattgt 
ttcctatgtc 
tcctctctcc 
tggccattgt 
ttcaatattt 
gtgctcaaac 
ggcatcatta 
cataaatctc 
gtgattagag 
gggtgtcatg 
gttagtacta 
gccactggga 
atcagtttaa 
tacgtatttt 
gggtttggtt 
ccaaaacatt 
gttcagagac 
tttattgtta 
atccatacac 
tgtttttcct 
gtaggggcaa 
ctgctccctg 
tgtgcagtag 
tgcttcccga 
tcacacactg 
cccttgactt 
aggcttgttt 
tggctagtgt 
tgtgtctcct 
" caggttcaca. 
ttgaaaattg 
gtaatgattg 
ttatttgctg 
gcatattgtc 
aaatagtcaa 
acttcgtcag 
gtatcagagt 
attatagacg 
tatatatat'a 
aattcagtta 
tagcccaagt 
attttaaaaa 
ttcgtgacct 
gttttggttg 
gtttttctga 
ccaacataaa 
actttgggag 
ggcaaaatgc 
ccccctgtca 
cgcacacccc 
tatgtacaca 
aacagagata 
cccctggatc 
gatcctcgtg 
aagtaaagta 
gagtgatgtt 
ttgtaatgca 
gctggcggag 
cggacccgca 
gctgcgtcgc 



cccgagtgaa 

ccacccatgc 

tggagagaaa 

aagtctctga 

aatagtgaac 

tccctccctg 

tgacctacag 

atttatatcc 

actgcaatac 

acatttttat 

tcagtaactg 

tcatttaata 

tgtcctggga 

tggccgagag 

aatgagagac 

aacaaaggta 

tcacatgttt 

aacgaagtgg 

tattccaaag 

tgggggctag 

aattatattc 

ttttgactgg- 

cttgct'tggg 

aggg.gctggg 

tgagcagaca 

cccagctgcg 

atccctgcca 

agccactcgc 

ctacaagccc 

tgatttctct 

ctgtatccta 

ttttctttca 

ttttttaaga 

aaagtgtgat 

ctggccataa 

acaatctata 

caacaatggt 

aaatgtaact 

atactctgtt 

ctattaggct 

catgatatac 

tatatacaca* 

aaaacttttt 

gaaccgagaa 

tctgttagct 

gaggttatgt 

cataaccaac 

atcatccttc 

cacacaagtc 

cattttgcag 

agctcttcct 

gttcccccag 

attctatctg 

ctacgtattt 

tatcggagtc 

cgtctttcgc 

cgcccccgcc 

aagttcaggg 

taagccaatg 

tatgagctcg 

ggcgttcgtc 

ggctcccggg 

ctctaacctc 



gtttttgcca 
tcaccaagcc 
gtggttaaga 
acaatcaaag 
aattgcagat 
aataetcttg 
gaggcaccac 
agtatttatt 
tgggggtctt 
ggctttctat 
catagtgtcc 
tccaaggtgg 
ccctgagcgt 
aatagttgag 
ctcgttccca 
tctaccttta 
acagaaagca 
aggagtatta 
tctggtttcc 
ggccagtaag 
tttcggaaaa 
cattctattt 
gtggcagcag 
gtttctcctc 
gcaagtctcc 
tgtctgccgg 
ttccacgcac 
acatgcgagc 
atggaacatt 
ctctgtagct 
gcagggagat 
ggacttgtag 
gccctatgaa 
ttatggaaat 
agggaaatat 
gtcattaatg 
ctatttttgt 
cttagtaaca 
gggtgtttgg 
gcagcaaagg 
ttcacctatt 
tatatgtgtg 
tctatttgtt 
gatcgagttg 
ggaccagacc 
ttggtatgaa 
ctgtggaagg 
acatgagaat 
aggctgagag 
aggaagaaac 
atatgtatac 
ctgctaaata 
ccctatctcg 
ctagccaacg 
tggcacgggg 
• gtttatttta 
ccctggccgt 
aagctgctct 
tcagggcaag 
ggagaccagt 
ctgggactgc 
gcagggccgg 
gggctgtgct 



aaggcagaag 
cagggacaag 
aaactgcctt 
cctgtgagga 
cccctttcct 
cttttttctg 
tgtcacc'aac 
ttcaatactg 
tcaaagcaca 
ttcttttttg 
cagggccaga 
tgactaatgt 
gggcactcta 
aaagtggtca 
atcacggtca 
tcttatgttc 
gtcaactgag 
catttcagct 
tggtgcaatc 
g'catttgatc 
accacccatg 
tagccgtaag 
aaagcatagg 
cccagtacag 
cctcactccc 
gaggggctgc 
aaacacatcc 
acattccttc 
tctggaaaga 
ttagcatttt 
gaggattgct 
gattctttgt. 
gtgctttttg 
taaatcatct 
ctgcgatgca 
ctaaactcga 
aagaattaga 
gtaagcttgg 
gatagcaatt 
aagtttattc 
.ttttgtctcc 
tgtgtatgt'g 
tttcatctgg 
taggactaaa 
gacaatgtaa 
aaggtcacat 
catgaacacc 
tcctaatggg 
aatctcagaa 
tgaggtcctg 
cctgaatctc 
tagctgtctg 
gttacagtgt 
aggaggggga 
cacataaggc 
agcccagtct 
gaaactcagc. 
ttgggatcgc 
gcaacagtcc 
acttaaagtt 
acttgctccc 
ggccagagct 
ctttttccag 



aggaaagtcg 
gctcaccaag 
tactgaactt 
gacagctaac 
ttccttctcc 
ggactggtct 
aaagggtaac 
actatggaga 
aaaacatata 
tactggtctc 
gaccggccac 
ctggcaacaa 
ggagcacctc 
agaggtggat 
gtgcaactcg 
atatcctagg 
ctattcatgg 
ggaaacacat 
ggaggcatgg 
cacatgtatc 
tcctattttg 
actatgattc 
gtactttcca 
ctttctctgg 
cactgccatt 
caagtgccct 
acacactctc 
cttccttctc 
cgttcttgat 
gagaaagcaa 
gttctccatg 
gccatttgca 
catgtgtttt 
gtaaaaaatt 
cctaatgtgt 
ttttggcttc 
taaaatgtat 
catttagata 
aaaacaaagt 
aaaagtataa 
ttaatatgta 
cgtgtgcatg 
atatttgatt 
ggatagacat 
cataattgcc 
tttatattca 
catgtgcgcc 
accaagtaca 
ggttgtggaa 
gcaggttgca 
cgcccccttc' 
•tggctggctg 
agtcctcccc 
atcaaacaga 
agcacattag 
tccctgggcc 
ctctatccag 
tccaaatcga 
ctggccgtcc 
ggaggcccgg 
gtcgggtcgc 
cgcgtgtcgg 
gtggcccgcc 



gcgtagcaca 
atgagtttgg 
cttgggctaa 
cttcttattc 



tttcccctgt 
agagcatggg 
agtctttctt 
gagctctcct 
tttgcatgat 
aagagccact 
tcctggcatt 
agcctccatt 
agtattgcgt 
ccatgtgaac ■ 
aaagcctaaa 
cttttaataa 
aaaggtttgt 
•ccctagaatg 
caatgcctct 
ccagaaggct 
taaacttgat . 
acagcaagcc 
gcctccaagg 
ctgtgccaca 
catccagcgc 
gcctactggc 
tctgcctagt 
actctctcgg 
ccagcagggt • 
cttacctttc 
ggggtatgtg- 
tataatttgg 
aaaaaggcat 
gctttggaaa 
tttt.aaccct 
agctacattt 
acttgatata 
gaccatgaac 
attgatagtt 
actatccaag 
tatatatata 
tttaactttt . 
ctgcatatcc 
gcagaaatgc 
aaagctttgg 
gttttctgaa 
ctaaccaaag 
gtactgtggt 
gggtctatct. 
ttctcctgat 
ccctcagatg 
cgtatgcaac 
agggtcatcc 
aagagagaca 
agaaagccgg 
acctttagca 
cagcgacgac 
gttgtgcctg 
tccagcacct 
gagcccagga 
ccggcttcac 
cgggac'atgc 
ggtttctgag 



420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280' 
2340 
2400 
24 60' 
2520 
2580 
2640 
2700 
2760 
2820 
' 2880 
2940 
3000 
3060 
3120 
' 3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
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ccttctgccc 
ccctccacac 
agcccctgaa 
tggacagcag 
ccgcggccgc 
ctgaggctgc 
ctccgagccc 
acggccagca 
ccggcccgcc 
gcccggcagg 
cgcgggacgc 
ggttctgcgt 
aaacgtactc 
ctaacaaaac 
tccaggcaaa 
aaagttgggg 
ttttgtttaa 
agctatgatg 
cccacgtcct 
ctggctgagt 
taggtgtccc 
ctgccagccc- 
tcagccaggg 
• gagaccactt 
acaagaattg 
tttgtttttt 
gtgaacacct 
agggcgccac 
ctggtgtccc 
tttccaggga 
actgaagggc 
gagcatccca 
aacttttaaa 
agagaaaaag 
gattaagtct 
gatgtgaatt 
tcagttcttt 
cttggtagat 
gggatcgcat 
tgccctatct 
agagacaagg 
ttcagtttta 
agaattctgc 
gacagatatt 
aaaggaaagg 
agttttgggg 
tcagtttagt 
aaatatcaga 



tgcggggaca 

caaagcatct 

ccgtccgcag 

caagcccgcc 

cgccaacgcg 

ggcgttcggc 

gctgatgcta 

ggtgccctac 

ggcattctac 

agggagggag 

gcgacccgag 

gcagcccgcg 

tccacccact 

acacgttgga 

taaacacggg 

gtgtttggag 

tgtgctcccc 

atgacttt'ct 

ggggctttag 

ccgcgctgga 

gcgagtttaa 

agtgacaagc 

tctgcgcctc 

tgtgacttca 

tt'ctttctcc 

gttttttgtt 

tcctaggtgg 

atggctgggt 

tatatctttc 

tatttatagc 

agattctggg 

cagccacgca 

ggttggcata 

gatttcttct 

aattccacac 

ccataacctt 

tattgcaaat 

tatttactgc 

cttattcaaa 

tacactttct 

aacttaaaaa 

gaatgatttt 

ctgttattat 

atttagtatt 

attgattggg 

ctgaaagcag 

gagtctagaa 

gtgcatggta 



cggtctgcac 

gggatggccc 

ctcaagatcc 

gtgtacaact 

caggtctacg 

tccaacggcc 

ctgcacccgc 

tacctggaga 

aggtacccgc 

ggagggaggg 

ggtgcgcgca 

ctgcgttcag 

taccgtccgt 

aaaccagaca 

gcgctttgag 

ttagcagagc 

aaatttcctt 

gggaagcgat ■ 

agagcgattg 

gcggattgct 

aacaagccat 

ctgtcccacc 

gcagccactg 

atggcgaagg 

ctctcctctc 

ttcctgatgg 

ccacaggaca 

.cctgaattag 

tgccttgtaa 

agaaggaagg 

ggtattcagt 

gatccaagct 

tttcaaatta 

gcccaccttt 

.aacatttaaa 

atgatggact 

taatgctgct 

cttgctaata 

attgttcatc 

gcattacatg 

ttaaaaaaaa 

tcctgtgttt 

ccagcaagga 

t.ttcccctct 

ctcttttagg 

tgttttcaaa 

tattttttaa 

tttcgtaagg 



cctgcccgcg 
tactgcatca 
ccctggagcg 
accccgaggg 
gtcagaccgg 

tggggggttt 

cgccgcagct 

acgagcccag 

gcccgcgccg 

agaagggaga 

gggagcccgg 

agtcaagttc 

gcgagaggca 

aagcagcagt 

tcacttggga 

tcagcagagt 

tcatctagac 

tcctgtcacc 

ggagttgaat 

ggcatgtgac 

atggaagcac 

cggggagaat 

tggaaggagc 

ttgtgtgtcc 

cctcccattt 

gcctgcagag 

ggtgtacccc 

gcatttccca 

gccgtggacc 

ggactaaagt 

gtgcatcttc 

gtctttactc 

attttacttg 

ttttttttaa 

actgcttgat 

ccagaaacca 

gaatttcaat 

attacaaagt 

atcccagtga 

atattcaaac 

aacttgcaca 

tatttcccgg 

aaagaagcat 

cagtttttct 

aaactttaag 

ctgcttgtca 

aaggactaaa 

ataagttttg 



gccacggacc 

gatccaaggg 

gcccctgggc 

cgccgcctac 

cctcccctac 

ccccccactc 

gtcgcctttc 

cggctacacg 

cccgtcgggg 

gcctagggag 

ggcgcgcggc 

tctcgccg'gg 

gacccgaaag 

tatttgtggg 

aggtctcgct 

tttatttatc 

tatttgattg 

cgctttcccc 

gggtctgatt 

ttctgacagc 

aagtgcttaa. 

gccccggagt 

gcggccggtc 

tcattttaat 

tctcttgccc 

ggattaggtg 

ggactgggtt 

actgtacact 

•agtttttgtt 

gcagtttggc 

agccgccttg 

aaaagacaaa 

ttttaattta 

atggaagaac 

gtgaaggaag 

ttttcttccc 

gggcactaat 

gaacctggtc 

taagtggtat 

actcttagaa 

aatgggactc 

attatctttc 

ctatgcaagt 

gcttaaatga 

tttcttaagt 

tgacccagag 

atggaaagga 

t 



atgaccatga 

aacgagctgg 

gaggtgtacc 

gagttcaacg 

ggccccgggt 

aacagcgtgt 

ctgcagcccc 

gtgcgcgagg 

tggccgccgc 

ctgcgggagc 

ccagcccggg 

cagctgaaaa 

cccgggcttc 

ggaaaacacc * 

cttggcattt 

cttttaatgt 

gaaatatgtc 

tcctccccac 

tcggagttag 

cggaaatttg 

aaataatctc 

ggcgtgcggg 

caggacacag 

ttttttccct 

agtttctcct 

ggcgcttctg 

tggaagcttc 

ggtatccgga 

cagtattctg 

cccagaggat 

gagaaattta 

caatgaacaa 

gggttaaaac 

aaagtacagc 

gcactggtat 

tatttaattt 

gagactgctc 

aaatacagag 

cagtgtaata 

taataaaaaa 

tgtgtggaaa 

ctcttttgtt 

tcttcatatg 

ctctgggtat 

agttctcaaa 

ggtcatgaac 

atataataga 



4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7001 



<210> 5 

<211> 11001 

<212> DNA 

<213> Homo Sapiens 



<400> 5' 

ccaagtcaga 
actccacagg 
ctcctcatgt 
caaagcacca 
gtctggaagc 
ttgtcttcca 
ctagtgacaa 



tgttccccaa 
cctaatacct 
cattagatgg 
aggcaattct 
ctttgcttta 
gcacaattcc 
ccaagaaact 



ttacctgtgg 

tggagccaga 

actcctgcat 

tcctggcttc 

gttacaatgc. 

tcctccaagc 

.aagaactatg 



acaggtcagg 

aagcttccag. 

ctccagaaga 

ttgggacaac 

ctatacactc 

cttactgtag 

tactcacgtt 



catattctga 
gtaaaaagtc 
ttttccacac 
cctaggcttt 
ctggaactgt 
ctacagccca 
cacctcccca 



gtctaatttc 
tgaagggggc 
caggaaagat 
ggcatgagtg 
tttgcagggc 
tcagtcctgt 
gagtcatttt 



60 
120 
180 
240 
300 
360 
420 



ccttcaggac 
gctgctcatc 
cagggccagc 
cagggtgggg 
ttgcgggggt 
cttgtgcgtg- 
cggcgcctga 
ttcttgctgg 
gtgtaagaac 
ggtgagatgt 
cccaagataa 
tccccacgcc 
ctgggcattg 
ccactccaca 
cagggcttct 
cctgtccctg 
ttttgcatgc 
gcctagggcc 
ggcacatggc 
taaaatcagg 
taggaggatc 
atctctacta 
aactgttcag 
agctattatc 
acaaaaaaca 
ggccgaggcg 
aaaccccgtc 
ccagctactt 
tgaactgaga 
aaaaaaaaaa 
tgagttggag 
cggggccatg 
ccaccagtgg 
aatcactgtg 
ggctccaatc 
ctgtctggag 
cccaggctgt. 
agcgattctc 
acccggctaa 
cttgaactcc 
ggcgtgaacc 
tctagggact . 
accgactttc 
gtcagatagg 
aatactttgt 
.caggagagcc 
tgatggggac 
tccttaccca 
gccagggcct 
ccccaagtat 
aaccttactg 
tggaaaatga 
cgggaagaag 
agtgccaggg 
gctaacttgt 
cttcccttcc 
ccttcccttc 
cactgcacag 
tgcctgctgg 
tttcccttct 
gttttttttt 
ttggctcatt 
tagctgggat 



aaagctcagg 
atccaacagc 
tgctcagccc 
tgggaagccc 
catccaccac 
acagcacatg 
cacttgtgcc 
agggcacaga 
cgtccttgtt 
cagtctactt 
cctcagttgt 
cccttcctga 
tactccttga 
ggctcgtcct 
ctgaaaaatg 
atgtacatat 
tctgggtgga 
tcagcactga 
caggggccag 
ctgggtgtgg 
acttgatcct 
aaaaaagaaa 
gaggctgagg 
gtgtcactgc 
aaaacaggct 
ggtggatcac 
tctactaaaa 
aggaggctga 
tcgtgccatc 
aatcagtaaa 
tgagggtttg 
acccactgca 
acagagtagt 
acatgatgtt 
tgccagaggt 
atttggccca 
agtgcagtgg 
ctgcctcaga 
tttttgtatt 
tgacctcaag 
atcgtacceg 
ggattcttgc 
ttctcccaga 
ccctagttag 
cccagaagaa 
agggttctcc 
aaatggagag 
gaacccgctg 
tacaggaaaa 
ccaaaggtct 
gggacagtgg 
gggctggagg 
tacccagatg 
caagaaggaa 
tcttgtgatg 
tggatgaccc 
tggtgatgaa 
cccttccagg 
aagaggattt 
gcaccatagg 
tgagacagag 
gcaatctctg 
tacaggtgcc 



gccttgtcac 
ttccgcaagt 
ctgcatgctc 
accttggccg 
caagaacttt 
caggtgcttg 
ccgtcctggg 
gacagggcgc 
ctaaagaaat 
ggggctaggc 
gaccctctga 
gcagtcaact 
tcttctgctc 
gcaagatggg 
gcctctgggg 
acatagctgg 
taagggaaag 
atgtagcctt 
gaacagctga 
tggctcacat 
caggagtttg 
aaaaaaaatt- 
tgggaggatc 
actccagcct 
gggcacgttg 
ctgaggtcag 
atacaaaatt 
ggcaggagaa 
gcactccagc 
atcacacc.tc 
tcctggagaa 
gggtgagagg 
acttggagcc 
taaccttgac 
acacaggcag 
aagatttttt 
ctggatcttg 
ctcccgagta 
tttagtagag 
tgatccgcct 
acccagagat 
cggaagggtg 
ctcccttggc 
aatgcgagtg 
ttcccacaag 
ctgactccgc 
gacagatgag 
ctattctgca 
aggtaaaggg 
cctggatgga 
gcaaccttgt 
ctgtgaataa 
ggaggagtac 
gatcattttg 
gctctgagct* 
aggagccctg 
cctccctatc 
attctaaagg 
caacagccat 
gacccaccct 
tctcgctctg 
cctcctgggt' 
tgccaccaca 



tgggccctgc 
acactgtgaa 
aggtgcatgc 
tgacgctcag 



cgcagcaggg 
acagcatcct 
ccccgccggg 
accagcttca 
agagaaacca 
tgggtatgca 
gtatcaggca 
caccaagctc 
aatctcagct 
ccagcatgga 
cagtctttgg 
tgcccaccct 
acagaatcat 
aaggatacca 
gggactctga 
ctgcaatccc 
agaccagctt 
agccaggtgt 
gtttgagccc 
ggggaactga 
gctcacgcct 
gagtttgaga 
agccaggcgt 
ttgctcaaac 
ctgggcaaca 
aattgcacat 
ggcagcccat 
agtggagagt 
agttctccat 
ccaagctgca 
cttcctggtg 
ttttttttga 
gctcactgca 
gctgggacta 
atgggatttc 
gccttcacct 
ttttaactcg 



caggagccgc 
ggggaagtaa 
gtatataccc 
cgcgctcaaa 
-cctcaatgac 
tgggcaggta 
catcctgcaa 
gctgaacctt 
aaccttgata 
caaattactg 
catagctggg 
atgaagaggt 
tgagaaaggt 
cacagggccc 
aaactgactg 
gaacccacca 
ttggcttctc 
cagaagcagg 
agagggactc 
agcattttgg 
gggcaacata 
ggtggtgtgc 
gggagattgc 
gtgagaccct 
gtaattctag 
ccagcctgac 
ggtggcacat 
tcgggaggtg 
agagcgaaac 
■ tctgatcaca 
ttttctcctc 
ggtgcacatc 
gtctcacaca 
taaaaggcag 
ggtttctgca 
. gacgaagcct 
agttctgcct 
caggcgcgtg 
accacattgg 
cccaaagtgc 
accactcact 



gagtgtggga 
cccactgcat 
tcaicagacac 
gtttcctgta 
cctggagtcc 
ggagcagggt 
gccagcaagg 
ggaggggtgg 
gaagagcact 
cgggttagta 
agcagtggat 
taggggcagg 
tttgcagaac 
cctacctaac 
•ggctgggatg 
ttcactcaga 
aggaagagtt 
cccagtcgga 
ccactctacc 
ttttccaggc 
tcaagcaatt 
actggctaat 



cagggcagcc 
cagccttact 
agctaagctc 
gaatgatctt 
ccttaagcac 
ggagcgtttt 
atgtggggct 
gaatttaagc 
ggagtaaaaa 
aaa'acaaatg 
gtgtttgttc 
agaaatgcca 
agggagggca 
aatgagaaag 
cagtgacctc 
aaacagactt 
tctttttctg 
tgcacagcag 
actgtccata 
tggagtgcag 
ctctgcttca 
tttttgtatt 



agccgcaggg 
tgatcagaga 
tcacataggg 
gagtgcaaac 
ttcacgtgcc 
aaaggaagtg 
ggagggtggc 
gatgaagcct 
ataggttcca 
cttgcgccca 
tacctgctcc 
tgctgttgat 
caggtgtctc 
ttgaggaacc 
cctttggccc 
ctgctcctgg 
tctgctgcct 
ggcaactgaa 
tcatttaaag 
gaggctaagg 
gcaagacctc 
ctgtagtccc 
agctacagta 
gcttcaaaac 
cactttggga 
caacatggag 
gcctgtaatc 
gaggttgcag. 
tcggtctcaa 
gcaccctagt 
tgccccggca 
agtagtccag 
tagtgagaaa 
ctttaggcca 
cctgcctgtg 
cactctgtcg 
cctgggttca 
ccacaacaac 
ccaggttggt 
"tgggattaca 
ccccacctca 
agggctctga 
tcctgttgac 
agcgctgacc 
gtgcctagcc 
ttaaaccatc 
agcagaatgc 
aagaactaag . 
tcattttctt 
ccccagtaca 
gtgtgggccc 
agtacaccaa 
gacagactct 
cagggatggt 
cttgctcctt- 
atttccagcc 
ggattagagg . 
tttccaaagc 
gaccatggaa 
aaaactgatg 
tggtgcgatc 
gcctcccaag 
tttagtcgag 



480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820- 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 • 
3480' 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
. 4200 



: - 



acggggtttc 
ctcggctttc 
tgtttttttc. 
tttttttttt 
cccagtctcg 
ctccccagta 
tcagtacaga 
gatctgcctg- 
ccttctttct 
acgatcttgg 
ctgggactac 
ggtttcacca 
tggcctccca 
ttttgagaca 
actgcaacct 
ggattacagg 
ttttcccacc 
ggggtctaca 
agcaccaccc 
cactttcttt 
agtccctgtt 
■ atggcatctg 
acagtcctgg 
gctttccccc 
tttcctatct 
ccacattcca 
catacaggag 
tacttctcaa 
gagggagcag 
tttagtcata 
ccgtgttcaa 
tggaatcggg 
ccacgtagcc 
ggcggagctc 
tcccggcgcg 
tccgagtcct 
gaaggcgcct 
agccgccaac 
cccgggccag 
cttgtctcat 
acctcatcgc 
gccagcgtcc 
cgggaagtgc 
ggagtacttg. 
ctctgtgccg 
ctcgagaatg 
gcgctcacca 
caacagtcca 
gagaggcctg 
ggaagcccca 
ggttacctca 
cggaaggcca 
cttccccttt 
tcacccaggc 
tcaagcgatt 
■gcccagctaa 
ctcaaactcc 
ggcttgagcc 
tacttgaagg 
ctactcccct 
.ccctctaccc 
ctcagcgatc 
cagcatcatt 



accattttgg 
caaagtgctg 
ttttttttta 
ttttttttga 
actcactgca 
gctgggatta 
cggggttttg 
ccttggcctc 
tttttttgag 
ctcactgcaa 
agtcacacac 
tgttggccag 
aagcattggc 
gagtctcact 
ctgcctccca 
tgtgcacctc 
tttccacagt 
ctcaactcaa 
attttgtaag 
acttaaataa 
tctttttcct 
gcacacagtt • 
gttagtaacc 
ttcctgtggg 
attcccaaat 
gcagctggtg 
tgcagatgga 
ttcctggcga 
gagcattcga 
gctggataat 
cctgtctgtg 
gtcttacgca 
aggagggtgg 
cagcgaccgc 
gcctgcgagc 
cttggctgca 
ccgcctcgcc 
caccgccccg 
agccgcgccg 
tggggcagga 
tccggagctc 
gggcaagcgc 
gcgtgcgcgg 
cggagccggc 
ccgggaaatc 
cctgcactgt 
cgttcgtgtc 
ggcagacgag 
gcggtgaggc 
gggagagggc 
cactgctacg. 
caggcagaga 
cctcattggc 
tggagtgcaa 
cttctgcctc 
tttttgtatt 
tgatctcagg 
accgcacccg 
aaaaactaaa 
cagaccaaga 
actccagcca 
tgtgtactac 
tgaagcttgc 



ccaggctggt 
ggattaaagg 
acatataact 
gacagagtct 
acctctgtct 
caggcacaca 
ctatgttggc 
ccaaaatgct 
acagagcctc 
cctttgcctc 
caccatgccc 
gctgacctcg 
attagaggtg 
ctgtcactca 
ggttcaagtg 
cgtggctagc 
ggaaaatggc 
cccttgcacc 
gaattttgta 
ggatttccgc 
•ggtgcctagg 
■ctccgataat 
tggatctctt 
taaagtgctc 
ctttctccag 
acaggccaga 
ggagcaggat 
ggactcttcc 
acgcggaaat 
gcccggctca 
acagaaacca 
cggttccgcg 
ggctgcccac 
ttcccctccc 
tagcgaggtt 
gtagccactg 
catagccgta 
gtcgcgtgcg 
caaccgttaa 
acgccggggc 
cactcacaga 
acaagagtgg 
agcctgggtc 
aatccaggct 
ggcaattaga 
ggcctgccca 
ccgctccacc 
cgcgcggcag 
ggaggagctc 
cgctgctcgc 
cggactctaa 
ggcctgctca 
aattaaaaaa 
tggcgcgatt 
agcctcctga 
tttagtagag 
tgatccaccc 
gcccttcact 
cacctttaac 
cactggtctc 
ggcttgcctc 
caaatggaat 
tccctttgac 



cttgaactcc 
tgtgagccac 
tgggacttct 
tgctctctca 
tetgggttca 
ccaccatggc 
ctggcaggtc 
gagattacag 
accctgtcac 
ccggttgaag 
ggctaatttt 
aattcctgat 
tgagccaccg 
ggctggagtg 
atccttctgc 
cctccttttc 
tcaggactga 
caagctctgg 
aaattttatc ' 
cccatttctg 
cctggttctc 
ggggatacct 
tttccacctc 
ctgaatgctt 
atccacccta 
acagggaaga 
ctctctctgc 
ctctccacat 
cgaggtgcta 
ggtctaccac 
agggggcccc 
ggcaggtccc 
cgacccagga 
gcccgccggc 
cgcgcggtga 
ctcgtcgtgc 
cccgcccgtc 
tgcgtgtacg 
gactgaaacg 
ggggacacgc 
ccccacctac 
cctctggccg 
agcctgggcc 
cccctcccag 
acgctccttg 
tcctcgccct 
gcgggttccc 
cggtagtggc 
caggtcgggg 
caggctccgc 
tgttggccac 
acagttggat 
acaacaacaa 
tcggctcacc 
gtacctagga 
acggggtttc 
gcctcggcct 
gggaacgtat 
ctacgtctgc 
tatacactct 
ttcctccagg 
ccagtgttcc 
tcccaggggc 



tgacctcatg 
tgcacctggc 
cagcctccta 
tccaggctgg 
agtgatactc 
cagataattt 
tcgaactctt 
gcatgagtca 
ccaggttgga 
tgattcagtc 
tgtatgttta 
tgcaaatgat 
tacttggctt 
cagtggcacg 
ctcaccctcc 
aattggttag 
ctgacatgaa 
gctaagattt 
"tgaagcatca 
ccaggcatac 
tgagcctggt 
aggaggttcc 
tttaggcatt 
atggtccaaa' 
gaggaaggga 
ggtgagggct 
ctctcaagtt 
cctcccctag 
gtccaaactg 
aagccataca 
ggcacccagc' 
cggccaggac 
cgcggcaacg 
accccctggc 
agtactgctc 
• tgctccaggt 
ccccagtcct 
cgtgtcagtg 
tagatcgccg 
acgcttcgcc 
cacagggaac 
gaggcgaggg 
cgggtccgct 
cccccgcgca 
cgcgcggcac 
tcccatacgc 
agcccaggtc 
aggtgaactt 
aaatgtcccg 
aggcccgacc 
ctgggcgtct ■ 
ctctatcgcc ' 
aaaactgcgt 
gcaacctccg 
ttacaggcgc 
accatgttag 
tccaaagtgc 
atggaataca 
cctgtggttg 
aatccttcgc 
aaactgcccg 
attctccatt 



atccacccac 
ctaaaactga 
ttctttcttt 
aatgcagtgg ; 
ctgcctcagc 
ttttgtattt 
ggcctcaagt 
ccaagcccag 
gtgcagtggc 
tcccaagtag 
gtagagatag 
ccacctgcct 
ccttttctat 
atcttggctc 
caagtagctg 
tgtcttgtgg 
gacaagccca 
tggcgtgctg 
ctcacaactc 
tgagcttcac 
ggtcacacca 
gagacacctt 
ttataatcta 
acaagacttc • 
acagaatctt 
cagctggctc 
ttcctaaaca 
tctccccaag 
ctcggtcggc 
gctgcttttt 
atctaggcgg 
ccgcggggag 
gaccggggag 
tcccacctgg 
gagctccgag 
catttcgaaa 
gcgcgtccgt 
tgcgcgtgcg 
ggatctagct 
cccaggaatg 

.gggggcgggt 

cgggaaggtg 
tgcagcgggt 
gaattagcct 
ccaggcagcc 
cctcggcccc 
ccggggcccg 
gcaatctgca 
gagattgaag 
tatctcagtg 
ggaaaccggc 
tagcacagaa 
cttgcttttg 
cctcctgggt 
ccgccaccat 
ccaggctggt 
tgggactaca 
tctgcccatt 
tcacctgtct 
cttcactctc' 
ggacagggtc 
ctcaccccct 



tacactctcc cagttttcct 



4260 
4320 
4380 

■4440 
4500 
4560 
4620 
4680. 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 

.5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 

.6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660. 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380' 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 



• • • • 



cctaccccct 
acgatgaagt 
agatccacgg 
cactaggatg 
cagcctcact 
cagttactgg 
taaacatgcc 
cgctgcaatg 
actctgtcgc 
ctgtgttcaa 
cactacgcct 
caggctggtc 
gggattacag 
actttgtgct 
atgcacacac 
atgctcaacc 
tagggtccac 
ggactgtaaa 
gcttg'ctggc 
ttagcatctg 
cctgcttgca 
taatttccct 
■ ctgggaagat 
ttttaaagat 
atctgtggcc 
agcgatgggg 
ccgcgacttg 
gcgcggggcc 
tcgccacaga 
cggccaggga 
ttggcacgct 
" agctcaatga 
gggcgctagc 
cccaggatcc 
gtgcagggac 
cttgcccttc 
cggcgaccgc 
tagaggaaga 
acacttggcc 
gagggagatc 
ccggcagagg 
gcatgggtgg . 
caggacacaa 
tgggtattga 
ttctaggata 
aagcctctgc 
• ccagcctgta 
ggtgggggca 
ccacaggcct 
ctggagctgg 
ccagatttct 



gcagctcctg 
ctgggctcag 
ctgaaacact 
tctaaaagac 
gctacacttg 
agacaaaagc 
caaatcatcc 
tcccagcctt 
caaggctgga 
gcaattctcc 
ggctcatttt 
ttgaactcct 
gcatgagcca 
ccccccagct 
agacctggga 
cagactccat 
tcccagcagt 
agcacccctg 
gattcccagg 
attcactggg 
gggggcccac 
tcaccctaaa 
ggactccgag 
aagaaactta 
cagatacgag 
cgaaagtaac 
acccgcggcg 
actactcacg 
ggtcgcacca 
ccagctgccg 
ccagccgggt 
gctcaggctc 
aagcgcgggc 
agactgggcg 
tgtggggccc 
cttccctcct 
aaagcaccac 
gggtccccac 
tacccactgg' 
ggtcagcagc 
ctggaccgtg 
tcggccctca 
gtctttccat 
ggatggcaat 
ccctcacctg 
actgcagtaa 
ggtctcagcc 
ccccacccac 
taccttcgcg 
tgcttggcaa 
gggatctagg 



ctcagctcct 
tcctgatcac 
gacctaaacc 
gtttcaagtg 
cccag.cttcc 
ccaaacttat 
tgcttcttat 
gttctttttt 
gggcagtggc 
cacctcagcc 
tttttatttt 
gacctcaggt 
ccgcgctcgg 
gaatttgtga 
cactatccat 
gttacaataa 
tgattcaaca 
gtgattccag 
ggtgcagaag 
tcagaaaagg 
caaaggttct 
ggttctggag 
gacagtaggc 
aaatcagcgt 
tggagtgcga 
ggacctagtc 
actgcgctgc 
cgcgcactgc 
cgtgtgcgtg 
t'gtggggttg 
gcggcccttc 
ccccgacatg 
cgggcgggg'c' 
gcgggaagga 
gaaggcgggg 
tcgtcccctc 
gcggagatac 
atccggccct 
gtggggcagg 
tttaccgccc 
atcttgaggt 
gcaactgcag 
gcttttccca 
gcatgtgggg 
cagcaatacc 
gcacagcggg 
cacctggggg 
atccatgcct 
cctgcagccg 
ttgcctgcca 



ttgcagattc 
tggcctggtc 
ctcagactag 
aacatggcca 
tctttgcagc 
ctttgatttc 
ctccatggct 
tttttttttt 
acgatctcgg 
tcccgagtag 
.ttagtagaca 
gatccacccg 
ccccttgttc 
tgtcctcttg 
ccacaagttc 
tttggggagt 
ggtctgcggt 
ctggtgtcta 
gactgctggg 
gtgtttgcta 
aaatttttcc 
ggggtcatga 
acaaaccctt' 
atttttacat 
caagggataa 
ctcgggagct 
cccttggctg 
aggcctttgc 
gcgggccccg 
cacgcggtgc 
ccagcgcgcc 
gcccggttgg 
cacagggcgg 
gctgaggaga 
ctgggcgcgc 
ctcacacccc 
ccgtgtttct 
ggccctcctg 
aagtctcgag 
gctctgctct 
tcaggggtgc 
ccctcatttg 
gtgcttgact 
gatgtgggag 
actcattctg 
gccgctctgg 
aaagtcagga 
cctgcatccc 
caggtcctgc 
cttgcctctg 



tgactcaact 
tgtctacatt 
atcctccgtg 
aaatttaatt 
aaaaatggcc 
tcccttgtct 
actttatttc 
tttttttgag 
ctcactgcaa 
ctgggattac 
tgag'gtttca 
cctccgcctt 
attctttgca 
taccggatga 
ctaaataggc 
ttttaaaatt 
gggatccagg 
cccaggggag 
tgtgtggctg 
aataaagact 
aggctccctc 
gtgtttgaga 
tctcaagaag 
ataagcagcc 
accattttcg 
gtccccgccg 
ccccttccgc 
gcacgacgcc 
cgggctggaa 
cccgcgcgat 
cagcgggtgc 
gcccgtgctt 
gccccgactt 
gccgcgcaat 
tctcgcagag 
accccggacg 
ggaggccagc 
gtccggtttg 
ccttcacttg 
ccactgcgga 
.attctgggtg 
gctctgtcac 
tggcactccc 
tagggcttag 
gcatcgtgag 
agccactgcc 
aggtctgact 
ctccaccctc 
tctgaggggc 
ttttctcgct 



tccatatctc 
catctgcccc 
ccagtacctt 
cccttttctt 
actaggctcc 
ctacctctga 
tctttgagaa 
acagagtctc 
cctccgcctc 
aggcacccgc 
ccatgttggc 
ccgaagtgct 
ttctgtcaca 
gagggtctcc 
cagagcagtg 
tactgatgcc 
ctagcgggga 
agcaaccttt 
cgtgcatatt.. 
caacaaaact 
ccataggtgg 
agaggcaagc 
ggccaaggca 
acctctgctc 
cgcactcttc 
accccctctg 
tctcgtaggc 
ccagatgaag • 
gcggtggcca 
gcgcagcgcg 
cagctcccgc 
cgctggcttt 
cagcgcctcc 
ggaaacctgg 
c.cccccccgc 
gccacaacga 
tttactgtgc 
ctgaagcaac 
gggtgaggag 
gactggggct 
gattcccttg 
cctgggctgc 
tgcaggcagg 
aggtccaagg 
cagcgcttag 
tctagcacat 
ggccctggaa 
cctgccattt 
tgaacacatg 
ccagccgctc 



8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
858.0 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
•9660 
9720 
9780 
9840 
.9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
•10680 
10740 
10800 
10860 
10920 
.10980 
11001 



<210> 6 
<211> 3501 
<212> DNA 

<213> Artificial Sequence 



<220> . 
<223> chemically treated genomic DNA (Homo sapiens) 

<400> 6 

gcggggttgg taggggcgtf gttttggtat • agttcggggt ttggtagcgg cgggtggggt 



60 



atcggttaag 
acgggtcgcg 
aggggagttt 
ttgcgagtcg 
tttttttaag 
gaggcgtcgg 
gttcgggtta 
ttttttggga 
tttcgatata 
tcgttgtttt 
cggttagggc 
tggggtaatt 
ggtaaagttt 
attgttttta 
gttaggtgtt 
ttttgttttt 
gtttttcgtc 
gtggagggga 
ggtcggtgtg 
cgttcggtta 
tgtgcgacgt 
cggttagtag 
atgatatggt 
tggttgacgg 
tgggcgtcgt 
agaaacgttt 
gcggttacgc 
aggtttgtta 
gtttagaggt 
tcgtcgtatt 
agaagagttc 
• .cgcgttcgga 
tgtcgtcgtt 
tatttcgcgg 
gtttttttta" 
gtcgggagcg 
cgttcggggg 
ggttcggcgc 
gcgaggattt 
atggcgagtt 
tttttag.ttt 
tgtacggtag 
ttggaggcgg 
. tttatcgttg 
agaagacgta 
agcgtgggat 
acgcgtgcgg 
cgggcgagaa 
ttattagtta 
cgggtttggg 
tcgagggcgt 
ataaggcggt 
tggcgttgga 
ataaggcggt 
ttatcgatcg 
gcggttttgg 
ggataatcgt 
ttgtttcgtg 



agttgttatc 
gggtttcgcg 
agggttcggt 
agggtttggg 
tttggggcgg 
gttcgcgcgt 
gggtcgtagt 
gttgcgtggt 
acgttttttt 
cgtttttcgc 
ggcgttaggg 
tttttcgagg 
ttcgttttga 
aaagggttat 
ttggggcgtg 
ttttttggtt 
ggtgtttagg 
gaagtgtcgg 
tgttttcgta 
ttttaggtag 
gattatcgtg 
cgtttattt.t 
gagttcggtc 
cgtagaggcg 
gttggtcgtc 
taagcgttac 
gttttatggt 
ttcgttttta- 
cgcggttaat 
ttgtgtttcg 
gtcgggtttc 
tagttttttt 
gtcgtcgttg 
cggtagcggt 
tcgttggatg 
cggttttttt 
gttttcgttc 
gggcggcgac 
tagttcgttt 
cgagagtttc 
.tgagtagttg 
ggtcgaggcg 
cggggataag 
cgcgtcgtgc 
ttggttgatt 
tatgacgcgt 
tatgcggttt 
gttttacgag 
tatgaagatg 
ggggtttttc 
ttttgttgtg 
cgcggtcgag 
gagtttttat 
cgaggtgttg 
tttttttttt 



gtcgcgggga 
ggttaggagg 
ttcgggtttt 
gtcggcgtat 
cggagtttcg 
gtagggttta 
cggttgtgcg 
tttttttttt 
cgcgtcgggt 
•gcggtcgtcg 
cgggtatcgc 
cgggaggcgt 
atgatttttt 
tgcgtttgaa 
taggtcgttt 
cgggcgggtc 
tcgtagggtt 
gttggggtta 
ggagagtgtg 
ttgttgttgt 
gtgtagaacg 
aagtttttgg 
gtgtttcgtt 
gttgcggtcg 
gttagttatt 
ggtaagtatt 
cggtcgggtc . 
•gtcgggtttt 
acgtattgcg 
gagcgtcgtt 
gcggcgttag 
agcgtcggtt 
tttttttaga 
agttcgggat 
aagtacgagt 
agcgagcgtt 
ggtttggcgt 
ggcgacgatt 
ggcggttatt 
ggtgataatt 
aacgcgtacg 
gtcgaagtgg 
gtcgtcgggg 
gataagagtt 
cggttttatt 
tatatgcgta 
acgcgttagt 
tgttaggtgt 
tacgtcgtgg 
ggcgttttcg 
gttcgtttta 
ttgttggcgt 
tcgttggtta 
agttagggcg 
atttagagcg 



ggggagttcg 
ggaacggggt 
gtcgtcggat 
ttttttcgtt 
ggggagaagg 
ggtcgaggtc 
tcgtgttcgt 
tttttatttg 
tcggttttcg 
ttcgggtttc 
gttttttttt 
tggtttttcg 
ttgaagcgga 
tagttttttt 
tggttttttt 
ggtttgggtt 
gatgttttcg 
ggcggttagg 
ttgggtagac 
agtttaataa 
ttttttttcg 
tggtgtatga 
tggtgttgga 
cggtcgtggt 
tgtagatttt 
gttatttgcg 
ggggtttgcg 
cgtcgtcgtt 
tcgagttgta 
gttttttttt 
agcggtcgtt 
tcgtcgttta 
agttggagga 
tcgagttttt 
cgggtttggg 
gcgaagagcg 
cgtcgtcgcg 
ataagagtag 
tcgagggtta 
tgtacgtgtg 
tggaggttta 
tcgt'tggggt 
tttcgggtgg 
ataaggattc 
tatgtattat 
gttatttggg 
atcgttttac 
gcggcggtaa 

ggggcgcggt 

gtttcgacgg 
cggtcgagta 
agattacgta 
agtttacggt 
tttatttggc 
tttttcgtta 



ttcgtatttt 
agcgtcgtta 
ttttagttcg 



agggagcggc. 
tagtggcggt 

g 



gggggcggcg 
tttatttttc 



gttcgtcggg 
cgggcgggcg 
ttgggggtcg 
ttgtttgtag 
ggtcggggga 
gggacgcggg 
tcggggcgtt 
ttttttgttt 
cgttttgttc 
gatcgagggt 
ttcgtattat 
gttttttttt 
tattttattt 
ttcggaagtt 
tttatcggcg 
tttattttag 
tttagttgag 
gcgtcgtacg 
gatgttggat 
ttagcgtatt 
cgcgtataag 
taatttgttt 
ttttatttat 
ttcgggggtt 
cgatttcgtg 
gggcggcggc 
ggtcgttacg 
tgtcgcggag 
cgcgtcggga 
ttgtggtttg 
ggttgagcgc 
taaggagtcg 
ggtcgtatcg 
cggtcgtttc 
tagttatggc 
tggtggggac 
ttattttggt 
tagcgaggag 
tttatgttcg 
tatttcgtgc 
cgtggaggag 
cgtcggttta 
tttgggagag 
ggttacgttg 
ttgcgggaag 
ttttaagttt 
ggagtatatg 
gttcgtatag 
cggcgcggtc 
taagggtaag 
gttgagtttg 
ttttttgtac 
cgagttgggt 
ggtcgggttc 
gttcgtt'ttg 
cgtagggttt 
ggcggtttta 



atcgtaggta 
agtagcgggt 
cgaggaagag 
t'tggaaaatt 
gtcgcggagg 
tggggcgtag 
gttttttttt 
tagttttttg 
gttacggtag 
tgatagtttt 
tttttttaat 



tttttatttg 
aaatcgggta 
ttagtattta 
gtcgtttatt 
agggtagtcg 
ggaaggggaa. 
gtttttattc 
acgatggagg 
aagggttttt 
aacgtgttgg 
aatttggatt 
atcggtcgtt 
gagtcgagtt 
gcgttgtgta 
ggcggcggcg 
tcggttattt 
tcgttttcgg 
ttcggttcgg . 
gatttgttta 
gagttgtttt 
tttttcgttt 
tttttcgatt 
gacgggttta 
gacgagttgg' 
gcggtcgttt 
agtttggacg 
atcggtagta 
tatttggttt 
ggtaagggtt 
gaggaagcgt 
gggttttttt 
ttgttgcggt 
cggtagtacg 
aagtttacgt 
ttcgcgtgcg 
cgtatttatt 
taacgtaatt 
ggggcgttgg 
ttcgattttt 
aagtagtagg 
gattttaagg 
tttagtttcg 
gacggtcgga 
tcgttgttgc 
attgtgttcg 
tttggtttta* 



120 
180 
240 . 
300 
360 
420 
480 
540" 
600 
660 
720 
780 
840 
900 
960 
1020 • 
1080 
1140 
1200 
1260 
1320 
1380 
1440- 
1500 
1560 
1620 
•1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
25.20 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
'3180 
3240 
3300 
3360 
3420 
3480 
3501 



<210> 7 
<211> 3501 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 7 

tcgagttaag gtacgaagta gtgaggttag gtgaggtcgt cgagaggtgg agtcgttatt 60 
gtggcgacgt tgcggttgtt tcgggtatag tgggttttgc gcgtcgtttt cgtcgttttt 120 
• tggggtgcgg gttagggtcg cgtagtagcg atagagcggg ttggcgaggg gcgttttagg- 180 
tgggagagaa acggtcgatg' gttcggtcgt cgggttcggt cgttaggtga gcgttttggt - 24 0 
ttagtatttc ggtcgttttg tcggggttga ggtttagttc ggtcgtgaat ttggttagcg 300 
ggtagaggtt ttttagcgtt attttggggt cgtgtaggaa gtgcgtggtt tgcgttagta 360 
gttcggtcgc ggtcgttttg ttttgttgtt ttaggtttag ttgttcggtc gtgaggcgag. 420 
ttatagtaaa gacgttttcg gggaagtcga gtttgttttt gtcgtcgggg tcggggacgt- 480 
cggggagttt ttttaagttc gttagcgttt cggtcgcgtc ggtcgcgttt tttacggcgt 540 
gtatttttat .gtggttgatg aggttgcgtt gttgtgcgaa tttgtcgtcg tatatttggt 600 
attcgtaggg tttttcgttc gagtggatgc gtatgtgttt cgtgaggcgg tattggcgcg 660 
tgaatcgtat gtcgtacgcg tcgtacgcga agggtttgag gtttaggtgg ttgcgtatgt 720 
ggcgcgttat ggttttacgt tgcgtgaatt ttttttcgta gatggtgtat gggtagggtc 780 
gggttagtta gtgcgttttt tcgtgttgtc gtagcgtggt cgggtttttg tagtttttgt 840 
cgtacgacgc gtagcggta'g ggtcgtagta .gtttttttag' gttattcgga gtttcggcga 900 
ttttgttttc gtcgttttta aaagggggtt ttaggtcggc ggttttagcg gttatttcgg 960 
tcgtttcggt tttgtcgtat agcgtttttt ttttttttac gtgagttttt acgtgcgcgt 1020 
ttagttgttt agagttgggg aagtttttgt cgtacggaat gtatacgtat aggttgttat 1080 
cgaagttttc gggttcgtta taggttaggt gcgggtatgg gtagttttcg aggtggtcgt ' 1140 
taggcgggtt ggggttttcg ttgttatcgg ttttttcgtt gttgtttttg tagtcgtcgt 1200 
cgtcgtcgtt cgcgtcgggt tcgtttaggt tgttagggta gcgcggcggc ggcgttaggt 12 60 

cgagcggggg tttttcgggc gagacggtcg cgtttttatt acgtttttcg tagcgttcgt 1320 

tgggggagtc gcgttttcgg tttagttcgt cgttatagtt atttaggttc ggttcgtgtt ' 1380 

ttatttagcg atagaggaga ttaggttcgt cggggcggtc ggggggttcg ggtttcgggt 14 40 

tgtcgttgtc gtcgcgaaat gggtcggaag gcggtgcggt tttttttagt ttttggaagg '■ 1500 

.gtagcggcgg tagbgacggt agggcgagag gcggtttttt gtaggcggcg gggtcggcgt 15 60 

tgggagggtt gttcgggcgc gggggtagtt cgcgtttagt tagcggtcgt tttggcgtcg ' 1620- 

cggagttcgg cgggtttttt ttggataggt ttaggttata aagaggggag tagcggcgtt 1680 

tcgaggtata gagtgcggcg gtcgggtcgg gtttcgacgc gtatagttcg gcgtagtgcg 1740 

tgttgatcgc ggtttttggg ttcgagggcg gtttcgcggt aggcggcggc ggaggttcga 1800 

ttggggacgg gtagtaggtt tggatgatcg gcgtggcggt tcgtaggttt cggttcggtc 18 60 

gattataggg cgcgtagtcg tcgtcgtcgt cgtcgtcgtt tcgtaggtgg tagtatttgt 1920 

cgtggcgttt gaggcgtttt ttgtatagcg- ttacgaggtc ggggatttgt aggtagttgg 1980 

cggcggttag tacggcgttt aggttcggtt tagttttcgg ggttacggtc gcggtcgtag 2040 

tcgtttttgc gtcgttagtt aggcggtcgg tgtagatgaa gtttagtatt aggcggaata- . .2100 

cggtcgggtt tattatgtta tggtttaggt tgagtaggtt gttatgtatt attagggatt 2160 

tgaggtaggc gttgttggtc gttagtacgt ttttgtgcgc gcggaagagg gcgttttgta . 2220 

.ttacgatgat tacgtcgtat aagaagtttt tggtgcgttg gttgttgagt tgtagtagta 2280 

gttgtttgga gtggtcgggc -gtttttatcg tgtttagtat cgtttgttta gtatattttt 2340 

ttgcggggat atatatcggt cgggtgagag tcgtgcggcg ttttggtcgt ttggttttag 2400 

ttcggtattt ttttttttta tttttttttt ttttagttga gcgggggtat ' tagttttgcg 2460 

gtttgggtat cggcgaagga tcggttgttt tttggagtgg gagtttaggt cggttcgttc 2520 

ggattaggag aaggagtagg aggtgagcgg tcgtcggtgg aggggaggtt agggcggttt 2580 

gtacgtttta gggtatttgg "ttgggtgttg gggttttcga gaagaaaatt gtttaggcgt ' 2640 

agtgattttt ttggagatag ttattcgatt taagtaaaat gttcgtttta ggaaaagtta 2700 

tttagggcgg agaattttat ttaagtaggg agaaagggag tcgaggaatt agcgtttttc 2760 

gtttcgggag aagttgtttt agttggggga agtgatacgg aggaggggag cgcggtgttc - 2820 

gttttggcgt cgttttggtc gggggttgtt aattttcggt cggggttcgg gcggcggtcg' ' 2880 

cgcggggagc ggaggtagcg gttgtcgtgg cgggtagagc gcgaaggtcg ggttcggcgc * 2940 

ggggagggcg ttatatcggg gtaggaggtt gaggtaggaa gtaggtgggg gggagggggg 3000 

agttacgtag tttttagggg agggaggggg tagcgtttcg ggcgggtacg gcgtatagtc 3060 

ggttgcggtt ttgattcggg tttgcgtttt attcgcgttt cggtttcggt ttgggtttta . 3120 

tacgcgcggg. ttcggcgttt tttttcgcgg- ttttttcggt tttttttttt tcggae^tttc 3180 

gtcgttttaa atttggggaa aagtttttta attgtagata gggcgggagg agtgcgtcgg ' 3240 

ttttaggttt -tcggttcgta gttttttttc gcggttttta aattcggcgg tagagttcgg 3300 

agtcgagttt tgagtttttt tgttcgttgt tcgttcgttc gatttcgttt tttttttggt 3360 

tcgcggggtt tcgcggttcg ttatttgcgg tttcggcggg tcgggttttt tttttcgcgg 3420" 



in 



• 



cggtggtagt ttttagtcga tgttttattc gtcgttgtta ggtttcgagt tgtgttaggg 
tagcgttttt gttagtttcg t 

<210> 8 
<211> 2501 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 



3480 
3501 



<400> 8 

tttttatagt 
tagtataaaa 
tttttttttt 
gggtcgttat 
ggatagggtt 
ggttgtatcg 

.atttaggaga ' 

.agattattta 
aagtagtggg 

- gtagggatta 
gttatttttg 
aaatttagaa 
gtgtcgagtt 
aaatggtttt 
ggattttaag 
gattttcgtt 
gttttgtttg 
cgacgggcgg 
.tcgcgttttt 
gtagttgagg' 
attttttgag 
tttttagtat 
ttgttgattg 
tatttttttt 
cgcgaggggc 
ttattcggtc 
tttttttaaa 
gttataattt 
ataggggtta 

. ttggagagta 
gcggtaaaat 
aataaattat 
agtgggtgat 
ttattggtag 
taaacgttat 
taaagtggtt 
attttatatt 
ttttttaaat 
attttataac 
gaggtttagg 
ggggttatat 
tttatatatg 



gtaaatgtgt 

tttaaatatt 

taaatttagt 

tttaggtttt 

tgtttatttt 

ttattgaaag 

tgtttagtaa 

ggaatatttg 

ggtttaggat 

ttatattgat 

gaagtttttt 

aacgtttatt 

aattattttt 

tggaaggtaa 

ggagtcgcgc 

tcgttttcgt 

ttcgtgtgtg 

gtggtattgg 

tgttgaggcg 

ttatttcgtt 

gaggaaagag 

ttcggtgtgg 

taaaattggg 

gtaattaatt 

gggggttgga 

gtttagtgtt 

gagggcgttt 

gtt.ttcgaaa 

ggcgtatgat 

gagaatttcg 

ggaaaattgt 

ttatattaat 

ttttgtaatt 

tattttatat 

aaattggaat 

ttttatgttg 

tatagttgtt 

gtttttagga 

gtttggtgtt 

gattatattt 

ttgaaaatta 

gtatgtagta 



ttttattatt 

taaatatgat 

tagtgtttgt 

ggtttttttt 

gatattatta 

tttattattt 

atgttgattg 

ttaaaatgtt 

tttgtatttt 

aaatttttat 

aaggataggg 

gagtgaatga 

ttatataagt 

gttttttagc 

gttttgcgtt 

tatttttcgt 

tgtcgttgta 

ttatcggcgc 

tagttggttg 

gaggtggtgg 

tcggaaatat 

gttttttaag 

tgaaagtttt 

tttttagatt 

attcgttttt 

tgtttttttt 

gataagagtt 

tttattggag 

ggagaaagaa 

aagtggatcg 

tatgtgaaat 

atagatgttg 

tttaagtgta 

ttggaaaggt 

tttgttaatt 

ttttttagaa 

tggttatttt 

aaatattttg. 

ttaaattttt 

taagacgttt 

tttaaggata 

tattttatat 



ttttggagta 

tttgttggtt 

gttgtttgta 

tagaaagttt 

gtttatttat 

gattttgagt 

aattatgatt 

tttgtttttt 

gatagttttt 

tatttttaag 

gtcgtatgtt 

tgaaacgata 

tttagtagtt 

gagtggagtt 

tattttttta 

ttttttttgt 

ggttttattt 

gggggtaggt 

tttttttcgg 

ggaggggaat 

ttggtttttt 

gttttgtttg 

ttttgatttt 

tttgtttagt- 

tgaagcgtag 

tgtcgtaaga 

ttttttcgtt 

ttttggtaga 

aatgggagta 

gttagtggga 

agttttattt 

tgtagtgaga 

gggattttga 

tttaatgtat 

aatttttat'g 

aatgttttta 

tgtatgggta 

tttatatttt 

tttgatagtt 

ttgttatcgt 

gggattgttt 

tagtatttag 



attttattta 
ttgtttttgt 
atgttttttt 
tttttttttt 
atatattggt 
agtttgagga 
gtttttaata 
gagattttat 
tgatgtttgt 
ggggattttt 
gtttttttag 
ggtgaataga 
tttattgttt 
atttttaatt 
ttagaaatcg 
ttcgtagtcg 
atgggtttat 
gagtatgcga 
gtcggtatac 
ggttattttt 
aagtaggtat 
agaggagagg 
tatttgtggg 
tttttaaagg 
agatgttagt 
ttttaagttc 
ggagtttgta 
ggttgtaagt 
ggatgggtat 
aagttgtttg 
tttaaagtat 
ttttatatta 
tattatgtat 
gaattatttg 
tatttttata 
gtgttgatga 
tgtatttggg 
gtttttattg 
tttggataat 
taaggagatt. 
• tttttgatat 
t 



aaatcgtttt 

ggttttattt 

ttttttgtag 

tttttagcgg 

tataagttta 

ttttattaaa 

tataaacgta 

tttgggaggt 

atgtagaagt 

ttttttaggg 

gttagtaatt 

tgaacgtaag 

ttagtcgtag 

atatttttta 

gtaagttatt 

gcgtttagcg 

cgttgaggtt 

aggttggagg 

gcgcgtagtc 

gaggtatcgt 

agttcgtttt 

ttaggttggg 

tatcgattgt 

attgaaaagt 

ttttgaaaag 

gtgagaggat 

tgtttagtaa' 

ttaaatgcgt 

atttgaggaa 

tattttagga 

aaaaaattaa 

gttttttatt 

ttttgatttt 

agttatatat 

tttgtattga 

atagttaagt 

tgtagttata 

taaataatgt 

ttttatgtag 

ttttttttta 

.tattagtata 



60 

120 

180 

240 

300 . 
.360 

.420 

480- 

540 

600 

660 

720 

780 

840 

900 

960- 
1020 
1080 
1140 
1200 
1260 
1320 
1380 • 
1440 
1500 
1560 
1620 
1680. 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
.2280 
2340 
2400 

2460 

2501 



<210> 9 
<211> 2501 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> chemically treated genomic DNA (Homo' sapiens) 



1 » 



<400> 9 • 

attgagtatt ggtgtaaaat gtattgtata ttatgtgtaa gtatgttaat ggtgttaaaa 
gaagtagttt ttatttttga atgattttta gatatagttt ttgaaaagga "gttttttt 
Lcgatggta aaaacgtttt agaatgtaat ttttggattt tttgtatgaa aattatttaa 
glg?tg??aa aaaagattta aaatattaag cgttgtaaaa tjt.tt.ttt ataataaaag . 240 
?aaagtgtaa ataaaatgtt tttttaaaaa tatttagaag gtatgattat atttaaatat 300 
atg??tatgt agagataatt agatagttat gggtataaaa tatttggtta tttattaata 3 60 

ttgaaagtat tttttgaaag gtagtataag aagttatttt attaatatag atatgaaagt 
atataggaat taattgatag aattttagtt tgtaacgttt aatatataat ttmtutt 
tatgtattag ggttttttta ggtataaggt attattagtg gagaattaaa ggtgtataat 
gttaagattt ttgtatttgg aggttataga ggttatttat tggtgagaaa ttaatgtaaa 
attttattgt atagtattta tgttggtatg aatggtttgt tttaattttt tgtattttaa 
aaaatggggt tattttatat aataattttt tattttgtcg tttttgaaat faggtaatt 
mSIttgg tcggtttatt tcggaatttt ttgtttttta gttttttaga tgtgtttatt 
ttatttttat tttttttttt tattatacgt ttgatttttg tgcgtatttg agtttataat 
JStgttaag attttagtgg atttcgagaa tagattgtga tttgttaagt atataaattt 
Scggggaa gggtttttS tagacgtttt ttttaaagaa ^tttttta' cgaatttaaa 
Stt?acgat agagggaaat aaatattgaa cgatcgaatg attttttagg jgttgatatt 
tttocotttt agggggcgaa ttttagtttt cgtttttcgc ggttttttag ttttttaaaa 
gat?aggtaa'aga??Laga gagttaattg taggaagagt aataatcgat gtttatagat 
aaqqqttagg gagaattttt atttagtttt gtaattagta gtttagtttg gttttttttt 
ttaggSga ?t?tgggaag tttatatcgg ggtgttgggg agaagcgggt tgtatttgtt 
toaqaqatta ggtgttttcg gttttttttt ttttaagaga tgcggtgttt taagaataat 
SSJttttt tttattattt ?agcggggtg attttagttg cggttgcgcg cgtatgtcgg 
ttcgaaaaga gtagttagtt gcgttttagt aaggggcgcg gtttttaatt ttcgtatgtt 
tatttrtttt cgcgtcggtg attagtatta ttcgttcgtc gaattttagc ggtgagttta 
^gaataaggt ??g?aacga? .atatatacga ataagtagag tcgttggacg tcgattgcgg 
gltaggagga ggcggggaat ggcgggggcg ggacgagggt tagtgatttg Jcgatttttg 
qtaqgaagat gagcgtagag cgcgcggttt ttttggaatt ttgggaaatg tagttaagag 
SmttStt cg?tggaaga tttgtttttt aggggttatt tttgcggttg gaagtaatgg 
llnllnllla oatttotota gaagaatagt taattcgata ttttgcgttt atttatttat 
?tg"cgtS SESEtt LaLaacgt tttttgggtt tagttgttga tttagagaaa 
^qtScgg tttttatttt tgaggggttt ttagagatag tttttgggaa ggaaagtttt 
SttlgggS ggtaaagatt tgt?agtgta ataatttttg tatttttata tgtaaatatt 
agggga??gt ?aaaatgtag agttttggat ttttattgtt tattttttaa "tagaattt 
taaggggtaa aaatattttg ataagtgttt ttaaatgatt ttgcgtttgt atgttgagaa 
tag??atagt ttaattaata tttattgagt attttttgag ttttgatagg atttttaagt 
tatttagagt tagatggtaa atttttaata acggtgtagt ttagatttgt gattaatgtg 
tgtaag?gag tXatggtgt taaaataaat agattttatt ttcgttgggg agaaagagga 
lllllltlll gaaggaggat taggatttaa agtggcgatt tttgtaaaga -gaagggta 0 
ttataggtag tataaatatt agttaggttt ggggagaaag agggtaaagt tataaaagta 24UU 
aagttagtaa gattatgttt agatgtttga attttgtgtt gaaaacggtt ttaagtagga 
ttattttaga gagtggtggg aatatattta tattatggaa a 



<210> 10 
<211> 2470 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA .{Homo sapiens) 
<400> 10 . 

aaagatgatt aaaagtttaa . ttgtttattt gaagagttga tttttttatt tttgtaataa 
Taaalllttt tagtagtttt tgtttatttt gtttattcgg ttttttttgt ^ttgtgtaa 
gg??araatt tt?gtgtttt agtaaatttg tgtatgttta tttttttttt tgttattatt 
ttttttttta ttttgtttta ttattttgat gtaaaattat ttgttaattt tatttgaaat 
gagaaatttt aaggtttata ttatttaaat tttgttagat ttttattttt «ttatatggt 
ttataatgtq ttgggtattt ttagatttgt ttattaaaaa gatgtaaaat aaaataatga 
.tt£"Sg? ggStttttt tttatttttg agatgttttt tttggttgta ttattttttt 



60 
120 
180 



360 
420 
480 
540 
600 
660 
720 . 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 



2501 



60 
120 
180 
2.4 0 
300 
360 
420 



attttttgtt 

aagaggttat 

ttttagtata 

ttgtaagtgt 

atgattaggt 

gtgatttttg 

attgtagatt 

tggtagagtt 

gtttgtttta 

gtttaagtat 

aggtaggatt 

tttgaatttt 

tttttttatt 

taattgttag 

tttttaggta 

agtataggtt 

tttatttatt 

tttattttgt 

tggaggatgt 

ttattaattt 

agtattatta 

aatgggtaaa 

tttgtgagaa 

tttattttga 

tgttttgaga 

gtttttttta. 

aaaatttttt 

ttagttgtta 

agaagataat 

aagattgtga 

tgaggtaaga 

.ttttgtaaaa 

atataaagaa 

taaagagaag 

tagaaattag 



tattgattag 
gttatatgta 
ttttttttta 
ttatttataa 
ttgaaattaa 
tttgtaggat 
ttagtcgttg- 
ttttaattgt 
ttttgtgtaa 
tgtttaagtg 
ttatttgttt 
atttggttaa 
tggttatttt 
attttgtttt 
ttttttggtt 
attatagtta 
atttttttat 
taaatgattg 
gtaagattag 
attttggttt 
atataaataa 
atatttttaa 
ggtaaatgta 
attaagattt 
ttaaaaatta 
tttttttttt 
ttaattttat 
aagtaagttg 
tttaattatt 
tttttaattg. 
tattttattt 
gttagtattt 
aataagtgta 
agattaaata 



aggaggggtt 
tatttttata 
ttaattttaa 
attgtgaaat 
tatataagat 
tttttttttt 
tagaattaat 
tgaattttaa 
tggattttat 
taaattattt 
taaaatgatt 
ttttaatata 
ttgtattttt 
attttttttt 
ttagtttgag 
attttttgaa 
taaaaaggtt 
ttttttaatg 
ttttttatta 
taaaattttg 
atttttatta 
aattttgaag 
tttagatata 
tattgtttta 
gtatattaat 
taaaaataag 
aggtttaagg 
aggtattgaa. 
ttataaaatt 
tagtttattt 
aaaagtgtat 
atatttttaa 
aaattttaat 
tagtattatt 



ttaattatgg gtgaatttta 
atataattta tatttatata 



taatattatt 

gaattaaaat 

ttaatttttt 

ttttagatgt 

tagatttaag 

tattgtcgtg 

attatagagg 

aatatttttt 

tgatttaaat 

atttttagta 

tttgtttttt 

gatagttttt 

tatagtagat 

taaatgtgat 

taaagttttt 

atattttaga 

aataaaaagt 

tgattattag 

taattatatt 

aaattattat 

attaaagttt 

tttttttaga 

ataatttttt 

tagttattaa 

gtagttaagg 

gatggagaat 

ggaaattgag 

atttttattt 

tttaaattaa 

ataagatata 

ggttagttag 

gtattgaggt 



gtaagttatg 

gaaagggtaa 

ttaattaaag 

tattggattg 

atgagttttt 

attagttagt 

tattttttta 

agttattaag 

taaaaagaga 

ttttataatt 

tttttttttt 

atttttaaigg 

tttaagatta 

tgaattttat 

atttaatgtt 

attttagaat 

gtgaaatgga. 

ataaaattta 

ttttaagttt 

ttgatagaaa 

ttttttttat 

tgttgttatt 

ttaa'acgttt 

attttttagt 

atggttgtag 

ttaaattttc 

gtatttaata 

agtatttttt 

gtaataatat 

ttgaatttat 

tttttagttt 

aaggtttttt 



tattttattg 

gtatttttat 

ttgaagtaga 

agattaaatt 

atttttgtag 

tattaggttt 

tgattttttt 

gttataattt 

atgttaagat 

taattaagat 

atgtggattt 

ttttttaaag 

ttagttataa 

ttatttattt 

tatatgttat' 

gttagtaatt 

tttttttatg 

tattttatta 

atatgtaatt 

gaaataaaat 

tgtttgtaag 

gtttaattta 

tttaatttta 

tgaataatat 

taagagtttt ' 

agtgaatttt 

ttttatatga 

gataagagtt 

tgaaggtatt 

tttgtaaatt 

gtaaattttt 

ttagtgaatt ■ 

tttttaagat 

gtgtagttta 



480 
. 540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2470 



<210> 11 
<211> 2470 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
'<400> 11 



ttagttttta 
ttttttttta 
ttttttatat 
ttttgtaaga 
ttttatttta 
ttatagtttt 
attatttttt 
taataattga 
aaaggatttt 
taaaggaaat 
ttttaaaata 
ttaagataaa 
tttttataga 
tttatttatt 
tagtaatatt 
ggattaatga 
atatttttta 



tgaattatat 

attttaaaaa 

gatttattga 

aagagtttat 

agtttataag 

gatattttta 

agtttttgtc 

ttatatgaag 

gaaatttatt- 

aaaattt.tta 

atattgttta- 

tgaaattaaa 

tggattaaat 

tttataggta 

attttatttt 

gattatatat 

tggtgaaata 



agaaaatttt 
gaattaagaa 
atgaatttaa 
atattattgt 
agggaatatt 
tattagatgt 
gaagatttga 
ttgtagttat 
attaaaaaat 
agacgtttaa 
agtagtagta 
atagaagaga 
tttttattaa 
aaatttagga 
tgaattttat 
tttattttat 
gttttgaagt 



gttttagtat 
ttaattagtt 
tatattttat 



agtgatgtta 
attggagttt 
ttaaaaatat 



ttaatttaaa atatattttt 



gaataaaaat. 
tttagttttt 
gttttttatt 
ttttggttat 
ttagtggttg 
gaagaattgt 
tttaagagga 
gaattttagt 
gtaataattt 
aatataatta 
ttgatagtta 
attttttatt 
tttaagatgt 



ggataaatta 
agttttgtaa 
tttagtgttt 
ttttggattt 
tttgttttta 
gttaatatgt 
tgaaatagtg 
tgtgtttgaa 
ttttaaggtt 
tgataaaaat 
tagaatttta 
tgataaaagg 
tattaaaagg 



tatttggttt 
tatatttatt 
aaatgttaat 
aagtaaagtg 
taattaaaag 
gatgattgga 
taatttgttt 
ataaggttaa 
aagaaagggg 
tagtttttgg 
gagttttagt 
tatatttgtt 
ttaaagatat 
ttatttatat 
gagttagaat 
ttaattttat 
taattattta 



60 
120 
180 
240 
300 
360 
420. 
480 
540 
600 
660 ' 
720 
780 
840 
900 
960 
1020 



•.•••« 
• # 



ataaaatgga 
ggtaaataag 
agtttatgtt 
tatttaaaga 
ttggtagtta 
ggtaagagga 
aagatttagg 
aattttgttt 
atatttaaat 
taaggtagat 
gattttatta 
aatttatagt 
tagggattat 
gtttggttat 
atatttataa 
tatgttaaaa 
ataatttttt 
ggtaaggggt 
ataggagtga 
tatattataa 
aaaatttttt 
taagagaaaa 
agttataatt 
aaagtatttt 
aattattttt 



tatgaaggag 

aattattaat 

atggtatatg 

gaatggatga 

ttatgattgg 

ttttaggaag 

agatttatat 

attttaatta. 

attttgatat 

aggttgtaat 

aaggaaatta 

aaatttggta 

ttataaaagt 

gatttaattt 

tttgttttaa 

ataaaagtat 

tagtaagata 

gaagaagtaa 

ttattatttt 

attatatgat 

attttaaata 

ggtagtaata. 

ttatataatt 

ttattatagg 



agtattaaat 
ataaagttta 
tggttttggg 
ttttaggagt 
gaaagaagag 
aattatagaa 
ttttttttta 
tttaatagtt 
taaaaaaatg 
. attggttagt 
ggggatttat 
taatttaatg 
ttttggttga 
ttgttttttt 
tataatttat 
tatgtaaatg 
tagggtttat 
tgtagttaaa 
gttttatatt 
agaggtaggg 
aaattaatag 
gagaaaaaaa 
ataaaaagag 
aataaaaa'aa 



gaagatttta 

attatattta 

atttgttgtg 

agagattgtt 

gagagataaa 

tgttaggagt 

gtttaggtta 

aaaaagtatt 

tttttgtaat 

tacgataata 

tttagattta 

atatttgagg 

aaaaaattga 

attttaattt 

agtgatatta 

taagttatat 

ttatagttaa 

ggagatattt 

tttttaataa. 

atttgataga 

gtaattttat 

tgggtatgta 

tcggatgggt' 

ttaatttttt 



agtttttttg 
tttaaaaggt 
tttaaattga 
aaagagaaat 
gaagatataa 
tatattaaga 
aattattttg 
aagtagtttg 
atgaaattta 
ttggagttta 
gttagtttta 
aggaagggga 
gttttgtgtg 
attttataat 
ttaggattaa 
tatgaaaata 
gatttttttt 
taaaaataaa 
gtaggtttga 
atttgaataa 
attaaaataa 
taagtttatt 
aagatgagta 
agatgaataa 



ataggaagat 
tgattataat 
ggttaaaaga 
gaagtagagt 
aagatagtta 
ttaattaagt 
gaataaataa 
tatttaagta 
ttatataaaa 
gtaattggaa 
taacggttag 
gttttgtagg 
ttaattttag 
ttgtaaatga 
taaaaaaagg 
tatatgtaat 
ttgattaatg 
ggaaaaattt' 
aaatatttag 
tgtgaatttt 
taaaataaaa 
gagatataga 
gagattgtta 
ttaaattttt 



1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460' 

2470 



<210> 12 
<211>.7001 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 



<400> 12 

aatgtaatgg 
gtgtggaagg 
ttggagagat 
gaatggtgat 
ttggagatat 
tagtaagtta 
gagttgggag 
gtatattttt 
aagagaatgt 
ttttgattgt 
tttttatgtt 
tttttttttt 
"tggttattgt 
tttaatattt 
gtgtttaaat 
ggtattatta 
tataaatttt 
gtgattagag 
gggtgttatg 
gttagt'atta 
gttattggga 
attagtttaa 
tacgtatttt 
gggtttggtt 
ttaaaatatt 
gtttagagat 
tttattgtta 



aaaaagagag 
taagggaaaa 
agtgatgttg 
ttatatttga 
atttaacgga 
cggttggaac 
ttcgagtgaa 
ttatttatgt 
tggagagaaa 
aagtttttga 
aatagtgaat 
tttttttttg 
tgatttatag 
atttatattt 
attgtaatat 
atatttttat 
ttagtaattg 
ttatttaata 
tgttttggga 
tggtcgagag 
aatgagagat 
a'ataaaggta 
ttatatgttt 
aacgaagtgg 
tattttaaag 
tgggggttag 
aattatattt 



attgtaaagt 
ttagttttcg 
ggtatagata 
ataagtttgt 
ggagttagat 
gtgtgtgttt 
gtttttgtta 
ttattaagtt 
gtggttaaga 
ataattaaag 
aattgtagat 
aatatttttg 
gaggtattat 
agtatttatt 

tgggggtttt 

ggttttttat 
tatagtgttt 
tttaaggtgg 
ttttgagcgt 
aatagttgag 
ttcgttttta 
'tttattttta 
atagaaagta 
aggagtatta 
tttggttttt 
ggttagtaag 
tttcggaaaa 



tagaaggttt 
aagaagatag 
cggggaagtt 
aatgtttagt 
taatttttat 
agtaggagag 
aaggtagaag 
tagggataag 
aaattgtttt 
tttgtgagga 
tttttttttt 
tttttttttg 
tgttattaa't 
tttaatattg 
ttaaagtata 
tttttttttg 
tagggttaga 
tgattaatgt 
gggtatttta 
aaagtggtta 
attacggtta 
ttttatgttt 
gttaattgag 
tattttagtt 
tggtgtaatc 
gtatttgatt 
attatttatg 



aggaattgtt 

tgagatttta 

gagaggaata 

agatcgttgg 

ttttttttat 

ggtagggagg 

aggaaagtcg 

gtttattaag 

tattgaattt 

gatagttaat 

tttttttttt 

ggattggttt 

aaagggtaat 

attatggaga 

aaaatatata. 

tattggtttt 

gatcggttat 

ttggtaataa 

ggagtatttt 

agaggtggat 

gtgtaattcg 

atattttagg 

ttatttatgg 

ggaaatatat 

ggaggtatgg 

tatatgtatt 

ttttattttg 



ttttgattag 
atttgggtgg 
ttatgtttga 
aaaagtgggg 
.ttgagagagt 
gaagttaaga 
gcgtagtata 
atgagtttgg 
tttgggttaa 
ttttttattt 
ttttttttgt 
agagtatggg 
agtttttttt 
gagttttttt 
tttgtatgat 
aagagttatt 
ttttggtatt 
agtttttatt 
agtattgcgt 
' ttatgtgaac 
aaagtttaaa 
tttttaataa 
aaaggtttgt 
.ttttagaatg 
taatgttttt 
ttagaaggtt- 
taaatttgat. 



60 
120 
180 
240 
300 
360 
• 420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020' 
1080 
1140 
1200 
1260 
1320 
1380 
1440. 
1500 
1560 
1620. 



• ♦ 



atttatatat 
tgtttttttt 
gtaggggtaa 
ttgttttttg. 
tgtgtagtag 
tgtttttcga 
ttatatattg 
tttttgattt 
aggtttgttt 
tggttagtgt 
tgtgtttttt 
taggtttata 
ttgaaaattg 
gtaatgattg 
ttatttgttg 
gtatattgtt 
aaatagttaa 
atttcgttag. 
gtattagagt 
at-tatagac'g 
tatatatata 
aatttagtta 
tagtttaagt 
attttaaaaa 
ttcgtgattt 
gttttggttg 

"gtttttttga 
ttaatataaa 
attttgggag 
ggtaaaatgt 

. ttttttgtta 
cgtatatttt 
tatgtatata 
aatagagata 
tttttggatt 
gattttcgtg 
aagtaaagta 
gagtgatgtt 
ttgtaatgta 
gttggcggag 
cggattcgta 
gttgcgtcgt 
ttttttgttt 
ttttttatat 
agtttttgaa 
tggatagtag 
tcgcggtcgt 
ttgaggttgc 
tttcgagttc 

. acggttagta 
tcggttcgtc 
gttcggtagg 
cgcgggacgc 
ggttttgcgt 
aaacgtattt 
ttaataaaat 
tttaggtaaa 
aaagttgggg 
ttttgtttaa 
agttatgatg 
tttacgtttt 
ttggttgagt 

. taggtgtttc 



ttttgattgg 

tttgtttggg 

aggggttggg 

tgagtagata 

tttagttgcg 

atttttgtta 

agttattcgt 

ttataagttt 

tgattttttt 

ttgtatttta 

ttttttttta 

ttttttaaga 

aaagtgtgat 

ttggttataa 

ataatttata 

taataatggt 

aaatgtaatt. 

atattttgtt 

ttattaggtt 

tatgatatat 

tatatatata 

aaaatttttt - 

.gaatcgagaa 

tttgttagtt 

gaggttatgt 

tataattaat 

attatttttt 

tatataagtt 

tattttgtag 

agtttttttt 

gtttttttag 

attttatttg 

ttacgtattt 

tatcggagtt 

cgtttttcgc 

cgttttcgtt 

aagtttaggg 

taagttaatg 

tatgagttcg 

ggcgttcgtt 

ggttttcggg 

ttttaatttc 

tgcggggata 

taaagtattt 

tcgttcgtag 

taagttcgtc 

cgttaacgcg 

ggcgttcggt 

gttgatgtta 

ggtgttttat 

ggtattttat 

agggagggag 

gcgattcgag 

gtagttcgcg 

tttatttatt 

atacgttgga 

taaatacggg 

gtgtttggag 

tgtgtttttt 

atgatttttt 

ggggttttag 

tcgcgttgga 

gcgagtttaa 



tattttattt 

gtggtagtag 

gttttttttt 

gtaagttttt 

tgtttgtcgg 

ttttacgtat 

atatgcgagt 

atggaatatt 

ttttgtagtt 

gtagggagat 

ggatttgtag 

gttttatgaa 

ttatggaaat 

agggaaatat 

gttattaatg 

ttatttttgt 

tttagtaata 

gggtgtttgg 

gtagtaaagg 

tttatttatt 

tatatgtgtg 

tttatttgtt 

gatcgagttg 

ggattagatc 

ttggtatgaa 

ttgtggaagg 

atatgagaat 

aggttgagag 

aggaagaaat 

atatgtatat 

ttgttaaata 

ttttatttcg 

ttagttaacg 

tggtacgggg 

gtttatttta 

ttttggtcgt 

aagttgtttt 

ttagggtaag 

ggagattagt 

ttgggattgt 

gtagggtcgg 

gggttgtgtt 

cggtttgtat 

gggatggttt 

tttaagattt 

gtgtataatt 

taggtttacg 

tttaacggt't 

ttgtattcgt 

tatttggaga 

aggtattcgc 

ggagggaggg 

ggtgcgcgta 

ttgcgtttag 

tatcgttcgt 

aaattagata 

gcgttttgag 

ttagtagagt 

aaattttttt 

gggaagcgat 

agagcgattg 

gcggattgtt 

aataagttat 



tagtcgtaag 
aaagtatagg 
tttagtatag 
ttttattttt 
gaggggttgt 
aaatatattt 
atattttttt 
tttggaaaga 
ttagtatttt 
gaggattgtt 
gattttttgt 
gtgttttttg 
taaattattt 
ttgcgatgta 
ttaaattcga 
aagaattaga 
gtaagtttgg 
gatagtaatt 
aagtttattt 
ttttgttttt 
tgtgtatgtg 
ttttatttgg 
taggattaaa 
gataatgtaa 
aaggttatat 
tatgaatatt 
ttttaatggg 
aattttagaa 
tgaggttttg 
tttgaatttt 
tagttgtttg 
gttatagtgt 
aggaggggga 
tatataaggt 
agtttagttt 
gaaatttagt 
ttgggatcgt 
gtaatagttt 
atttaaagtt 
atttgttttc 
.ggttagagtt 
ttttttttag 
tttgttcgcg 
tattgtatta 
ttttggagcg 
atttcgaggg 
gttagatcgg 
tggggggtt't 
cgtcgtagt't 
acgagtttag 
gttcgcgtcg 
agaagggaga 
gggagttcgg 
agttaagttt 
gcgagaggta 
aagtagtagt 
ttatttggga 
ttagtagagt 
ttatttagat 
ttttgttatt 
ggagttgaat 
ggtatgtgat 
atggaagtat 



attatgattt 

gtatttttta 

ttttttttgg 

tattgttatt 

taagtgtttt 

atatattttt 

tttttttttt 

cgtttttgat 

gagaaagtaa 

gttttttatg 

gttatttgta 

tatgtgtttt 

gtaaaaaatt 

tttaatgtgt 

ttttggtttt 

taaaatgtat 

tatttagata 

aaaataaagt 

aaaagtataa 

ttaatatgta 

cgtgtgtatg 

atatttgatt 

ggatagatat 

tataattgtt 

tttatattta 

tatgtgcgtt 

attaagtata 

ggttgtggaa. 

gtaggttgta 

cgtttttttt 

tggttggttg 

agtttttttt 

attaaataga 

agtatattag 

tttttgggtt 

ttttatttag 

tttaaatcga 

ttggtcgttt 

ggaggttcgg 

gtcgggtcgt 

cgcgtgtcgg 

gtggttcgtc 

gttacggatt 

gatttaaggg 

gtttttgggc 

cgtcgtttac" 

ttttttttac 

ttttttattt 

gtcgtttttt 

cggttatacg 

ttcgtcgggg 

gtttagggag 

ggcgcgcggt 

tttcgtcggg 

gattcgaaag 

tatttgtggg 

aggtttcgtt 

tttatttatt 

tatttgattg 

cgtttttttt 

gggtttgatt 

ttttgatagt 

aagtgtttaa 



atagtaagtt 

gtttttaagg 

ttgtgttata 

tatttagcgt 

gtttattggt 

tttgtttagt 

attttttcgg 

ttagtagggt 

tttatttttt 

ggggtatgtg 

tataatttgg 

aaaaaggtat 

gttttggaaa" 

ttttaatttt 

agttatattt 

atttgatata 

gattatgaat 

attgatagtt 

attatttaag 

tatatatata 

tttaattttt 

ttgtatattt 

gtagaaatgt 

aaagttttgg 

gttttttgaa 

ttaattaaag 

gtattgtggt 

gggtttattt 

tttttttgat 

tttttagatg 

cgtatgtaat 

agggttattt 

aagagagata 

agaaagtcgg 

atttttagta 

tagcgacgat 

gttgtgtttg 

tttagtattt 

gagtttagga 

tcggttttat 

cgggatatgc 

ggtttttgag 

atgattatga 

aacgagttgg 

gaggtgtatt 

gagtttaacg 

ggtttcgggt 

aatagcgtgt 

ttgtagtttt 

gtgcgcgagg 

tggtcgtcgc 

ttgcgggagt 

ttagttcggg 

tagttgaaaa 

ttcgggtttt 

ggaaaatatt 

tttggtattt 

tttttaatgt 

gaaatatgtt 

ttttttttat 

tcggagttag 

cggaaatttg 

aaataatttt 



1680- 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880- 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380' 

4440 

4500 

4560 

4620 

4680 

4740 

4-800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 



ttgttagttt 

ttagttaggg 

gagattattt 

ataagaattg 

tttgtttttt 

gtgaatattt 

agggcgttat 

ttggtgtttt 

tttttaggga 

attgaagggt 

gagtatttta 

aatttttaaa 

agagaaaaag 

gattaagttt 

gatgtgaatt 

ttagtttttt 

tttggtagat 

gggatcgtat 

tgttttattt 

agagataagg 

tttagtttta 

agaattttgt 

gatagatatt 

aaaggaaagg 

agttttgggg 

ttagtttagt 

aaatattaga 



agtgataagt 

tttgcgtttc 

tgtgatttta 

tttttttttt 

gttttttgtt 

ttttaggtgg 

atggttgggt 

tatatttttt 

tatttatagt 

agattttggg 

tagttacgta 

ggttggtata 

gatttttttt 

aattttatat 

ttataatttt 

tattgtaaat 

tatttattgt 

tttatttaaa 

tatatttttt 

aatttaaaaa 

gaatgatttt 

ttgttattat 

atttagtatt 

attgattggg 

ttgaaagtag 

gagtttagaa 

gtgtatggta 



ttgttttatt 
gtagttattg 
atggcgaagg 
tttttttttt 
tttttgatgg 
ttataggata 
tttgaattag 
tgttttgtaa 
agaaggaagg 
ggtatttagt 
gatttaagtt 
ttttaaatta 
gtttattttt 
aatatttaaa 
atgatggatt 
taatgttgtt 
tttgttaata 
attgtttatt 
gtattatatg 
.ttaaaaaaaa 
ttttgtgttt 
ttagtaagga 
tttttttttt 
"tttttttagg 
tgtttttaaa 
tattttttaa 
tttcgtaagg 



cggggagaat 

tggaaggagc 

ttgtgtgttt 

ttttttattt 

gtttgtagag 

ggtgtatttc 

gtatttttta 

gtcgtggatt 

ggattaaagt 

gtgtattttt 

gtttttattt 

attttatttg 

ttttttttaa 

attgtttgat 

ttagaaatta 

gaattttaat 

attataaagt 

attttagtga 

atatttaaat 

aatttgtata 

tatttttcgg 

aaagaagtat 

tagttttttt 

aaattttaag 

ttgtttgtta 

aaggattaaa 

ataagttttg 



gtttcggagt 
gcggtcggtt 
ttattttaat 
ttttttgttt 
ggattaggtg' 
ggattgggtt 
attgtatatt 
agtttttgtt 
gtagtttggt 
agtcgttttg 
aaaagataaa 
ttttaattta 
atggaagaat 
gtgaaggaag 
tttttttttt 
gggtattaat 
gaatttggtt 
taagtggtat 
atttttagaa 
aatgggattt 
attatttttt 
.ttatgtaagt 
gtttaaatga 
ttttttaagt 
tgatttagag 
atggaaagga 
t 



ggcgtgcggg 

taggatatag 

tttttttttt 

agtttttttt 

ggcgtttttg 

tggaagtttt 

ggtattcgga 

tagtattttg 

tttagaggat 

gagaaattta 

taatgaataa 

gggttaaaat 

aaagtatagc 

gtattggtat 

tatttaattt 

gagattgttt 

aaatatagag 

tagtgtaata 

taataaaaaa 

tgtgtggaaa 

tttttttgtt 

tttttatatg 

ttttgggtat 

agtttttaaa 

ggttatgaat 

atataataga 



5460 
5520 
5580 
5640 
5700 
5760 
5820 • 
5880 
5940 
6000" 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 . 
6600 
6660 
' 6720 
6780 
6840 
6900 
6960 
7001 



<210> 13 
<211> 7001 
<212> DNA 

<213> Artificial Sequence 



<220> * . . 

<223> chemically treated genomic DNA (Homo sapxens) 



<400> 13 

ataaaattta 
ttttagtttt 
atgataagta 
atttaaagtt 
tagaaaaatt 
gatgtttttt 
ttcgggaaat 
ttgtgtaagt 
tgtttgaata 
attattggga 
tattttgtaa 
tattgaaatt 
atggtttttg 
' tattaagtag 
tttaaaaaaa 
ataagtaaaa 
; tgagtaaaga 
tgaagatgta- 
tattttagtt 
tggtttacgg 
ttgggaaatg 
c'ggggtatat 
tttttgtagg 
aaaatgggag 
aggatatata 



tttttacgaa 

tttaaaaaat 

gtttgaaaat 

ttttaaaaga 

gagaggggaa 

ttttttgttg 

aaaatatagg 

ttttttttta 

ttatgtaatg 

tgatgaataa 

ttattagtaa 

tagtagtatt 

gagtttatta 

ttttaaatgt 

aaaaggtggg 

ttaatttgaa 

tagtttggat 

tattgaatat 

tttttttttt 

tttataaggt 

tttaatttag 

ttgttttgtg 

tttattagga 

ggagaggaga 

attttcgtta 



atattatgta 

attttagatt 

.attgttttta 

gtttaattaa 

aaatattaaa 

gataataata 

aaaaattatt 

atttttaagt 

tagaaagtgt 

ttttgaataa 

ggtagtaaat 

aatttgtaat 

taaggttatg 

tgtgtggaat 

tagaagaaat 

atatgttaat 

ttgcgtggtt 

ttttagaatt 

tgttataaat 

agaaagatat 

gatttagtta 

gttatttagg 

aaataaaaaa 

gggagaaaga 

ttgaagttat 



ttttgatatt 

tattaaattg 

gttttaaaat 

tttttttttt 

taatatttgt 

ggtagaattt 

ttaaaattga 

tttttgtttt 

aagatagggt 

gatgcgatt£ 

aatttattaa 

aaaagaattg 

gaatttatat 

tagatttaat 

tttttttttt 

ttttaaaagt 

gtgggatgtt 

tgttttttag 

atttttggaa 

agggatatta 

tgtggcgttt 

aaggtgttta 

taaaaaataa 

ataatttttg 

aaagtggttt 



ttttattata 
agtttatgat 
ttttgagaat- 
tatatttaga 
ttatatgaag 
taataaaaga 
atttttatat 
tttttttatt 
atattatatt 
tttttgtatt 
ggagtagttt 
aaaattaaat 
tatattagtg 
cgttgtattt 
tgttttaatt 
tttgtttatt 
ttaaattttt 
tattttttgg 
atagaatatt 
gttcggatat- 
tgaagttttt 
ttagaagcgt 
aaggagaaat 
tagggaaaaa 
. tttgtgtttt 



ttttttttta 

tttttgggtt 

tatttaagaa 

gttatttaag 

aatttgtata 

ggaaagataa 

agagttttat 

attttaagag 

gatattattt 

tgattaggtt 

tattagtgtt ■ 

agggaagaaa 

ttttttttta 

tgttttttta 

ttaaattaaa 

gtttgttttt 

ttaaggcggt 

ggttaaattg 

gaataaaaat 

tagtgtatag 

aaatttagtt 

ttatttaatt 

tgggtaagag 

aattaaaatg 

ggatcggtcg 



60 
120 
180.- 
240 
300 
360 
420 
• 480 
540 
600 
660 
720 
780" 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
i440 
1500 



- iy - • • • • 



cgtttttttt 
tatttttttc 
^tgtgttttta 
agttatatgt 
tatttaattt 
gggtgatagg 
agtttagatg 
aattttgttg 
tt'tttaagtg 
aattgttgtt 
ttgtttttcg 
agaatttgat 
ttcgggtttt 
tttttttttt 
gcggcgcggg 
gttgggttcg 
"tagttgcggc 
gaaatttttt 
gtcggtttga 
gttttcgggg 
tcgttttagg 
ttgatgtagt 
tcgcgggtag 
tttggaaaaa 
gagttttggt 
cgggagtaag 
taattttaag 
gggattgttg 
agcgatttta 
ggttgagttt 
• aagattgggt 
.tgttttatgt 
tttttttttt 
tatattgtaa 
atagatagtt 
ggagatttag 
ttaggatttt 
tttttgagat 
ttttattagg 
gggtgtttat 
aatgtgattt 
gttatattgt 
ttttagtttt 
tttagatgaa 
gtatatatat 
aggagataaa 
tgaataaatt 
taattgttat' 
gttaagttta 
atttaatttt 
atcgagttta 
gtgta'tcgta 
tagatgattt 
gtaaaaagta 
tataaagaat 
. tagtaatttt' 
taaaatgtta 
gttttt'ttag 
ggaagga'atg 
tggatgtgtt 
ggtagttttt 
ggggagtgag 
gttgtattgg 



atagtggttg 
gggtgggata 
tatggtttgt 
tagtaattcg 
ttaatcgttt 
aatcgttttt 
aaaggaaatt 
agttttgtta 
atttaaagcg 
ttgtttggtt 
tacggacggt 
tttgaacgta 
ttgcgcgtat 
tttttttttt 
cgcgggtatt 
ttttttaggt 
ggcgggtgta 
aggtcgttgg 
tcgtagattt 
tagttgtata 
gggattttga 
agggttattt- 
ggtgtagatc 
gagtatagtt 
ttcggttttg 
tgtagtttta 
tattggtttt 
ttttgttttg 
aagagtagtt 
tacggttagg 
ttaaaataaa 
gtttcgtgtt 
tcgttggtta 
tcgagatagg 
atatttagta 
ggtatatata 
agtttttttt 
ttttttagtt 
aatttttatg 
gttttttata 
ttttatatta 
cggtttggtt 
ataattcgat 
aaataaatag 
atatatatat 
aaataggtga 
ttttttgttg 
tttaaatatt 
ttgttattaa 
tataaaaata 
gtattaatga 
gatatttttt 
aatttttata 
ttttataggg 
tttataagtt 
tatttttttg 
aagttataga 
aaatgtttta 
tgttcgtatg 
tgtgcgtgga 
ttcggtagat 
gggagat'ttg 
ggaggagaaa 



cgaggcgtag 
ggtttgttat 
tttaaattcg 
ttttagcgcg 
tttaaagttt 
tagaaagtta 
tggggagtat 
attttaaata 
tttcgtgttt 
ttttaacgtg 
aagtgggtgg 
gcgcgggttg 
tttcgggtcg 
tttttttttt 
tgtagaatgt 
agtagggtat 
gtagtattag 
agtcgaacgt 
gcgcgttggc 
cggcgggttt 
gttgcggacg 
tagatgtttt 
gtgttttcgt 
cgaggttaga 
tttcgggagt 
ggacgaacgt 
tcgagtttat 
atattggttt 
tttttgaatt 
gggcgggggc 
cgcgaaagac 
agatttcgat 
gaaatacgta 
gtagatagaa 
gttgggggaa 
taggaagagt 
tttgtaaaat- 
tgatttgtgt 
tgaaggatga 
ggttggttat 
aatataattt 
tagttaatag 
tttttcggtt 
aaaaaagttt 
atgtgtatat 
agtatattat 
tagtttaata 
taatagagta 
gagttatatt 
gattattgtt 
ttatagattg 
tttatggtta 
aattatattt 
tttttaaaaa 
ttgaaagaaa 
ttaggatata 
gagagaaatt 
tgggtttgta 
tgcgagtggt 
atggtaggga 
acgtagttgg 
ttgtttgttt 
ttttagtttt 



attttggttg 
tgggttggta 
cgggatattt 
gatttagtta 
taggacgtgg 
ttattatagt 
attaaataaa 



tttttaattt 
atttgtttgg 
tgttttgtta 
agagtacgtt 
tacgtagaat 
cgcgtttcgc" 
ttttgtcggg 
cggcgggtcg 
ttgttggtcg 
cgggttcgga 
cgtagtttta 
ggcggtcgcg 
gttgttgttt 
gtttaggggt 
ggtgtggagg 
agggtagaag 
ggcgacgtag 
ttgcgggttc 
ttttcgttag 
atgtattata 
aaatattatt 
ttattttatt 
gtacgaggat 
ggatttaggg 
atatttttgt 
gtgtgtatat 
tggggtgtgc 
ttgatagggg 
tgtattttgt 
gtttttaaag 
gtttatgttg 
tttagaaaaa 
gtaattaaaa 
taggttacga 
atttttaaaa. 
tatttgggtt 
ttaattgaat 
atatatatat 
gcgtttataa 
gatttt.gata 
tttgacgaag 
tttgattatt 
ggataatatg 
ttagtaaata 
gtaattatta 
ttaattttta 
atgtgaattt 
aaggagatat 
gatattagtt 
aaaataagtt 
gaagttaagg 
ttagtgtgtg 
ttcgggaagt 
gttattgtat 
atagggagta 
•tttgttttta 



attcgtacgt 
ggagattatt 
ataaattttc 
gttaatttcg 
ggtggggagg 
tgatatattt 
aatattaaaa 
taaatgttaa 
aggtgttttt 
ggaagttcgg 
ttttttagtt 
tttcgggttg 
ggttttcgta 
cgcggcggtt 
gtttcgcgta 
tggggttgta 
gatacgttgt 
gattcggggt 
gcgttgaatt 
aggtatattt 
tttagttcgt 
gttatggtta 
gtttagaaat 
cgtatgtttc 
ggtgaagtcg 
tttttgggtt 
aaggtgttgg 
ttaggtataa 
tgtcgtcgtt 
ttgttaaagg 
gtcggttttt 
ttgttttttt 
aggatgattt 
ggttgtatac 
gtatttgagg 
tattaggaga 
tagatagatt 
gattatagta 
tttttggtta 
ttttagaaaa 
attaaagttt 
.tgtatttttg 
aggatatgta 
taaaagttaa 
atatatatat 
ttttggatag 
taattattaa 
tgtttatggt 
ttatattaag 
taaatgtagt 
aagggttaaa 
ttttttaaag 
aatgtttttt 
gttaaattat 
atatatattt 
agaaaggtaa 
tattttgttg 
gtcgagagag 
aattaggtag 
agttagtagg 
agcgttggat 
gtgtggtata 
tttttggagg 



tatttcgggg 
tttaagtatt 
ggttgttaga 
aaattagatt 
aggggaaagc 
ttaattaaat 
ggataaataa 
gagcgagatt 
ttttataaat 
gttttcgggt 
gttcggcgag 
ggtcgcgcgt 
gttttttagg 
atttcgacgg 
tcgtgtagtc 
ggaaaggcga • 
tgagtggggg 
cgtaggggag 
cgtaggcggc 
cgtttagggg 
ttttttggat 
tggttcgtgg 
cggcgggtta 
gtcgatacgc 
ggcgattcga 
ttcgggtttt 
aggacggtta 
ttcgatttgg 
gttggataga 
tggtttaggg 
tttaatgtgt 
ttttgtttga 
tggggaggat 
gtagttagtt 
ggaagggggc 
atgtaatttg 
tttttataat 
ttgtatttgg 
gggcgtatat 
ttgaatataa 
tggtaattat 
tatgtttatt 
gaattaaata 
atatgtatac 
atatatatta 
tttatatttt 
tattttgttt 
ttatttaaat 
tatatatttt 
tgaagttaaa 
aatatattag 
taatttttta 
taaaatatat 
atgtaaatgg 
ttatggagaa 
gttgtttttt 
gattaagaac 
tgagaaggaa 
agagagtgtg 
tagggtattt 
gaatggtagt 
gttagagaaa 
ttggaaagta 



1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040. 

2100 

2160 

2220 

2280 

2340 

2400 . 

2460 

2520 

2580 

2640 

2700 

•2760 

2820- 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 ' 

3720 

3780 

3840 

. 3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800' 
4860 
4920 . 

' 4980- 
5040 
5100 
5160 
5220 
5280 



• • 
• • • 



ttttatgttt 
ttttacggtt 
atatgggtgg 
ggattaaatg' 
cgattgtatt 
tagttgaaat 
gtttagttga 
tgaatataag 
ttgatcgtga 
ttgat.tattt 
ttagagtgtt 
gatattagtt 
ttttggtttt 
ataaaaaaga 
ttgtgttttg 
. ttagtattga 
tgttggtgat 
ttagaaaaaa 
aaggaaaggg 
ttttttatag 
aaaggtagtt 
ttttgttttt 
ttttttgttt 
ttttttttgt 
ggtaaagatt 
tgttgggtat 
taattttttc 
attgtttttt 
taagtttttt 



tttgttgtta 
aaaatagaat 
ttttttcgaa 
ttttattggt 
aggaaattag 
gtaatatttt 
ttgttttttg 
ataaaggtag 
ttgggaacga 
ttttaattat 
tacgtttagg 
attattttgg 
gggatattat 
aatagaaagt 
aaagattttt 
aaataaatat 
agtggtgttt 
gtaagagtat 
gatttgtaat 
gttttgattg 
tttttaatta 
gggtttggtg 
ttggtaaaaa 
tggatatata 
gatttggttt 
tgtaggtttg 
gtgtttgtgt 
tcgagggttg 
agttttataa 



ttttaagtaa 
gttagttaaa 
agaatataat 
tttagttttt 
attttggaat 
tttatttcgt 
taaatatgtg 
atatttttgt 
ggttttttat 
tttttcggtt 
gttttaggat 
atattaaatg 
gtagttattg 
tataaaaatg 
agtattgtag 
tggatataaa 
tttgtaggtt 
ttagggaggg 
tgtttattat 
tttagagatt 
tttttttttt 
agtatgggtg 
ttttattcgg 
cgttttagtc 
tttcgttgaa 
tttaaatatg 
ttagtattat 
attttttttt 
tttttttttt 



gaggaaaaat 
agtgtatgga 
ttaataataa 
agtttttgaa 
aaatgttttg 
taattaaatt 
aaaaatacgt 
tttaaattga 
tttttagtgg 
atagtattaa 
atatgatatt 
attttaatta 
agagatttat 
ttaatgatgt 
tgtttgagta 
taaatattga 
aataatggtt 
aggagagagg 
tgatatagga 
tataattaaa 
agtatttttt 
ggaaagtata 
gtttttagtt 
gtgatttatt 
tgtgttttta 
agttattatt 
tgttttttta 
gttttttata 
tttattgtat 



aggtttgttg 
tattaagttt 
aagttttttg 
tagaggtatt 
gtattttagg 
tataaatttt 
attattaaaa 
ttttaggttt 
cgtttatatg 
tacgtaatat 
taatggaggt 
taatgttagg 
gagtggtttt 
tattatgtaa 
taggagagtt 
aaagaaagat 
atttatgttt 
aataggggaa 
agaataagaa 
gttagtttaa 
tttaaattta 
ttgtgttacg 
tttttggtt't 
ggttttttta 
gttttatttt 
tttaaatatg 
gttatttaga 
tttaattaag 
t 



tgaattatag 
ataaaatagg 
ggatatatgt 
gttatgtttt 
gatgtgtttt 
tttatgaata 
gtttaggata 
tcgagttgta 
gatttatttt 
tgaggtgttt 
tttgttgtta 
agtggtcggt 
tgagattagt 
atatatgttt 
ttttttatag 
tgttattttt 
tagattagtt 
aggagaagga 
ggttagttgt 
gaagtttagt 
-ttttggtgag 
tcgatttttt 
ttttttttat 
ggtgaagaag 
tttagcggtt 
gtgttttttt 
ttaaaatttt 
aggtaatttt 



<210> 14 
.<211> 11001 • 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 14 



5340 
5400 
5460 
•5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7001 



ttaagttaga 

attttatagg 

ttttttatgt 

taaagtatta 

gtttggaagt. 

ttgtttttta" 

ttagtgataa 

tttttaggat 

gttgtttatt 

tagggttagt 

tagggtgggg- 

ttgcgggggt 

tttgtgcgtg 

cggcgtttga 

tttttgttgg 

gtgtaagaat 

ggtgagatgt 

tttaagataa 

tttttacgtt 

ttgggtattg 

ttattttata 

tagggttttt 

tttgtttttg 



tgttttttaa 
tttaatattt 
tattagatgg 
aggtaatttt 
ttttgtttta 
gtataatttt 
ttaagaaatt 
aaagtttagg 
atttaatagt 
tgtttagttt 
tgggaagttt 
tatttattat 
atagtatatg 
tatttgtgtt 
agggtataga 
cgtttttgtt 
tagtttattt 
ttttag'ttgt 
ttttttttga 
tattttttga 
ggttcgtttt 
ttgaaaaatg 
atgtatatat 



ttatttgtgg 

tggagttaga 

atttttgtat 

ttttggtttt 

gttataatgt 

ttttttaagt 

aagaattatg 

gttttgttat 

tttcgtaagt 

ttgtatgttt 

attttggtcg 

taagaatttt 

taggtgtttg 

tcgttttggg 

gatagggcgt . 

ttaaagaaat 

ggggttaggt 

gattttttga 

gtagttaatt 

ttttttgttt 

gtaagatggg 

gtttttgggg 

atatagttgg 



ataggttagg 

aagtttttag 

ttttagaaga 

ttgggataat 

ttatatattt 

tttattgtag 

tatttacgtt 

tgggttttgt 

atattgtgaa 

aggtgtatgc 

tgacgtttag 

cgtagtaggg 

atagtatttt 

tttcgtcggg 

attagtttta 

agagaaatta- 

tgggtatgta 

gtattaggta 

tattaagttt" 

aattttagtt 

ttagtatgga 

tagtttttgg 

tgtttatttt 



tatattttga 
gtaaaaagtt 
ttttttatat 
tttaggtttt 
ttggaattgt 
ttatagttta 
tattttttta 
taggagtcgt 
ggggaagtaa 
gtatatattt 
cgcgtttaaa 
ttttaatgat 
tgggtaggta 
tattttgtaa 
gttgaatttt 
aattttg^ta 
taaattattg 
tatagttggg 
atgaagaggt 
tgagaaaggt 
tatagggttt 
aaattgattg 
gaatttatta 



gtttaatttt 
tgaagggggt 
taggaaagat 
ggtatgagtg 
tttgtagggt 
ttagttttgt 
gagttatttt 
agtcgtaggg 
tgattagaga 
ttatataggg 
gagtgtaaat 
tttacgtgtt 
aaaggaagtg 
ggagggtggt 
gatgaagttt 

. ataggtttta 
tfctgcgttta 
tatttgtttt 

'tgttgttgat 
taggtgtttt 
ttgaggaatt 
tttttggttt 
ttgtttttgg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
• 600 
660 
720 
780 
840 
900. 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



- 
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• • • mm. 
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ttttgtatgt 
gtttagggtt 
ggtatatggt 
taaaattagg 
taggaggatt 
atttttatta 
aattgtttag 
agttattatc 
-ataaaaaata 
ggtcgaggcg 
aaatttcgtt 
ttagttattt 
tgaattgaga 
aaaaaaaaaa 
tgagttggag 
cggggttatg 
ttattagtgg 
aattattgtg 
ggttttaatt 
ttgtttggag 
tttaggttgt 
agcgattttt 
attcggttaa 
tttgaatttt 
ggcgtgaatt 
tttagggatt 
atcgattttt 
gttagatagg 
aatattttgt 
taggagagtt 
tgatggggat 
.tttttattta 
gttagggttt 
ttttaagtat 
aattttattg 
tggaaaatga 
cgggaagaag 
agtgttaggg 
gttaatttgt 
tttttttttt 
tttttttttt 
tattgtatag 
tgtttgttgg 
tttttttttt 
gttttttttt 
ttggtttatt 
tagttgggat 
acggggtttt 
ttcggttttt 
tgtttttttt 
tttttttttt 
tttagtttcg 
ttttttagta 
ttagtataga 
gatttgttt'g 
tttttttttt 
acgattttgg 
ttgggattat 
ggttttatta 
tggtttttta 
ttttgagata 
attgtaattt 
ggattatagg 



tttgggtgga 

ttagtattga 

taggggttag • 

ttgggtgtgg 

atttgatttt 

aaaaaagaaa 

gaggttgagg 

gtgttattgt 

aaaataggtt 

ggtggattat 

tttattaaaa 

aggaggttga 

tcgtgttatc 

aattagtaaa 

tgagggtttg 

atttattgta 

atagagtagt 

atatgatgtt 

tgttagaggt 

atttggttta 

agtgtagtgg' 

ttgttttaga 

tttttgtatt 

tgattttaag" 

atcgtattcg 

ggatttttgt. 

tttttttaga 

ttttagttag 

tttagaagaa 

agggtttttt 

aaatggagag 

gaattcgttg 

tataggaaaa 

ttaaaggttt 

gggatagtgg 

gggttggagg 

tatttagatg 

taagaaggaa 

ttttgtgatg 

tggatgattt 

tggtgatgaa 

tttttttagg 

aagaggattt 

gtattatagg 

tgagatagag 

gtaatttttg 

tataggtgtt 

attattttgg 

taaagtgttg 

ttttttttta 

ttttttttga 

atttattgta 

gttgggatta 

cggggttttg 

ttttggtttt 

tttttttgag 

tttattgtaa' 

agttatatat 

tgttggttag 

aagtattggt 

gagttttatt 

ttgtttttta 

tgtgtatttt 



taagggaaag 

atgtagtttt 

gaatagttga 

tggtttatat 

taggagtttg 

aaaaaaaatt 

tgggaggatc 

attttagttt 

gggtacgttg 

ttgaggttag 

atataaaatt 

ggtaggagaa 

gtattttagt 

attatatttt 

ttttggagaa 

gggtgagagg 

atttggagtt 

taattttgat 

atataggtag 

aagatttttt 

ttggattttg 

ttttcgagta 

tttagtagag 

tgattcgttt 

atttagagat 

cggaagggtg 

tttttttggt 

aatgcgagtg 

tttttataag 

ttgatttcgt 

gatagatgag 

ttattttgta 

aggtaaaggg 

tttggatgga 

gtaattttgt 

ttgtgaataa 

ggaggagtat 

gattattttg 

gttttgagtt 

aggagttttg 

tttttttatt 

attttaaagg 

taatagttat 

gatttatttt 

tttcgttttg 

ttttttgggt 

tgttattata 

ttaggttggt 

ggattaaagg 

atatataatt 

gatagagttt 

atttttgttt 

taggtatata 

ttatgttggt 

ttaaaatgtt 

atagagtttt 

tttttgtttt 

tattatgttc 

gttgatttcg 

attagaggt'g 

ttgttattta 

ggtttaagtg 

cgtggttagt 



atagaattat 
aaggatatta 
gggattttga 
ttgtaatttt 
agattagttt 
agttaggtgt 
gtttgagttc 
ggggaattga 
gtttacgttt 
gagtttgaga 
agttaggcgt 
ttgtttaaat 
ttgggtaata 
aattgtatat 
ggtagtttat 
agtggagagt 
agttttttat 
ttaagttgta 
ttttttggtg 
ttttttttga 
gtttattgta 
gttgggatta 
atgggatttt 
gtttttattt 
ttttaattcg 
gagtgtggga 
tttattgtat 
ttatagatat 
gttttttgta 
tttggagttt 
ggagtagggt 
gttagtaagg 
ggaggggtgg 
gaagagtatt 
cgggttagta 
agtagtggat 
taggggtagg 
tttgtagaat 
tttatttaat 
ggttgggatg 
tttatttaga 
aggaagagtt 
tttagtcgga 
ttattttatt 
ttttttaggt 
ttaagtaatt 
'attggttaat 
tttgaattt't 
tgtgagttat 
tgggattttt 
tgttttttta 
tttgggttta 
tt^ttatggt 
ttggtaggtt 
gagattatag 
attttgttat 
tcggttgaag 
' ggttaatttt 
aatttttgat 
tgagttatcg 
ggttggagtg 
atttttttgt 
tttttttttt 



ttggtttttt 

tagaagtagg 

agagggattt 

agtattttgg 

gggtaatata 

ggtggtgtgt 

gggagattgt 

gtgagatttt 

gtaattttag 

ttagtttgat 

ggtggtatat 

tcgggaggtg 

agagcgaaat 

tttgattata 

tttttttttt 

ggtgtatatt 

gttttatata 

taaaaggtag* 

ggtttttgta 

gacgaagttt 

agttttgttt 

taggcgcgtg 

attatattgg 

tttaaagtgt 

attatttatt 

tagggtagtt 

tagttttatt 

agttaagttt 

gaatgatttt 

ttttaagtat 

ggagcgtttt 

atgtggggtt 

gaatttaagt 

ggagtaaaaa 

aaaataaatg 

gtgtttgttt 

agaaatgtta 

agggagggta 

aatgagaaag 

tagtgatttt 

aaatagattt 

tttttttttg 

tgtatagtag 

attgtttata 

tggagtgtag 

ttttgtttta 

tttttgtatt 

tgattttatg 

tgtatttggt' 

tagtttttta 

tttaggttgg 

agtgatattt 

tagataattt 

tcgaattttt 

gtatgagtta 

ttaggttgga 

tgatttagtt 

tgtatgttta 

tgtaaatgat 

tatttggttt 

tagtggtacg 

tttatttttt 

aattggttag 



tttgttgttt 14 4 0 

ggtaattgaa 1500 

ttatttaaag 1560 

gaggttaagg " 1620 . 

gtaagatttt 1680 

ttgtagtttt 1740 

agttatagta 1800 

gttttaaaat 18 60 

tattttggga 1920' 

taatatggag ' 1980 

gtttgtaatt 2040 

gaggttgtag . 2100' 

tcggttttaa 2160 ■ 

gtattttagt 2220 

tgtttcggta 2280 

agtagtttag 2340. 

tagtgagaaa 2400 

ttttaggtta 24 60 

tttgtttgtg 2520 

tattttgtcg 2580 

tttgggttta 2640 

ttataataat 2700 

ttaggttggt 27 60 

tgggattata 2820 

ttttatttta 2880 

agggttttga 2940- 

ttttgttgac 3000 

agcgttgatt 3060 

gtgtttagtt 3120 

ttaaattatt 3180 

agtagaatgt 3240 

aagaattaag 3300 

ttattttttt 3360 

ttttagtata 3420- 

gtgtgggttt 34 80 

agtatattaa 3540 

gatagatttt 3600 

tagggatggt 3660 

tttgtttttt 3720 

atttttagtt 3780 

ggattagagg. •. 3840 

tttttaaagt ' 3900 

gattatggaa 3960 

aaaattgatg* 4 020 

tggtgcgatt 4 080 

gttttttaag 4140 

tttagtcgag 4200 

atttatttat 4260 

ttaaaattga 4320 

tttttttttt 4380 

aatgtagtgg 4440 

ttgttttagt 4500 

ttttgtattt - 4560 

ggttttaagt 4 620 

ttaagtttag 4 680 

"gtgtagtggt 4740 

ttttaagtag 4 800 

gtagagatag 4860 

ttatttgttt 4920 

ttttttttat 4980 

attttggttt" 5040 

taagtagttg 5100 

tgttttgtgg.' 5160 



t 



tttttttatt 
ggggtttata 
agtattattt 
tatttttttt 
agtttttgtt 
atggtatttg 
atagttttgg 
gttttttttt 
ttttttattt 
ttatatttta 
tatataggag 
tattttttaa 
gagggagtag 
tttagttata 
tcgtgtttaa 
tggaatcggg 
ttacgtagtt 
ggcggagttt 
tttcggcgcg 
ttcgagtttt 
gaaggcgttt 
agtcgttaat 
ttcgggttag 
tttgttttat 
attttatcgt 
gttagcgttc 
cgggaagtgc 
ggagtatttg 
ttttgtgtcg 
ttcgagaatg 
gcgtttatta 
taatagttta 
gagaggtttg 
ggaagtttta 
ggttatttta 
cggaaggtta 
tttttttttt 
ttatttaggt 
ttaagcgatt 
gtttagttaa 
tttaaatttt 
ggtttgagtt 
tatttgaagg 
ttattttttt 
ttttttattt 
tttagcgatt 
tagtattatt 
tttatttttt 
acgatgaagt 
agatttacgg 
tattaggatg 
tagttttatt 
tagttattgg 
taaatatgtt 
cgttgtaatg 
attttgtcgt 
ttgtgtttaa 
tattacgttt 
taggttggtt 
gggattatag 
attttgtgtt 
atgtatatat 
•atgtttaatt 



tttttatagt 
tttaatttaa" 
attttgtaag 
atttaaataa 
tttttttttt 
gtatatagtt 
gttagtaatt 
tttttgtggg 
atttttaaat 
gtagttggtg 
tgtagatgga 
tttttggcga 
gagtattcga 
gttggataat 
tttgtttgtg 
gttttacgta 
aggagggtgg 
tagcgatcgt 
gtttgcgagt 
tttggttgta 
tcgtttcgtt 
tatcgttfccg 
agtcgcgtcg 
-tggggtagga 
ttcggagttt 
gggtaagcgt 
.gcgtgcgcgg 
cggagtcggt 
tcgggaaatc 
tttgtattgt 
cgttcgtgtt 
ggtagacgag 
gcggtgaggc 
gggagagggt 
tattgttacg 
taggtagaga 
ttttattggt 
tggagtgtaa 
tttttgtttt 
tttttgtatt 
tgattttagg 
atcgtattcg 
aaaaattaaa 
tagattaaga 
attttagtta 
tgtgtattat 
tgaagtttgt 
gtagtttttg 
ttgggtttag 
ttgaaatatt 
tttaaaagac 
gttatatttg 
agataaaagt 
taaattattt 
ttttagtttt 
taaggttgga 
gtaatttttt 
ggtttatttt 
ttgaattttt 
gtatgagtta 
ttttttagtt 
agatttggga 
tagattttat 



ggaaaatggt 

tttttgtatt 

gaattttgta 

ggattttcgt 

ggtgtttagg 

tttcgataat 

tggatttttt 

taaagtgttt 

ttttttttag 

ataggttaga 

ggagtaggat 

ggattttttt 

acgcggaaat 

gttcggttta 

atagaaatta 

cggtttcgcg 

ggttgtttat 

tttttttttc 

tagcgaggtt 

gtagttattg 

tatagtcgta 

gtcgcgtgcg 

taatcgttaa' 

acgtcggggc 

tatttataga 

ataagagtgg 

agtttgggtt 

aatttaggtt 

ggtaattaga 

ggtttgttta 

tcgttttatc' 

cgcgcggtag 

ggaggagttt 

cgttgttcgt 

cggattttaa 

ggtttgttta 

aattaaaaaa 

tggcgcgatt 

agttttttga 

tttagtagag 

tgatttattc 

gttttttatt 

tatttttaat 

tattggtttt 

ggtttgtttt 

taaatggaat 

tttttttgat 

tttagttttt 

ttttgattat 

gatttaaatt 

gttttaagtg 

tttagttttt 

ttaaatttat 

tgttttttat 

gttttttttt 

gggtagtggt 

tattttagtt 

tttttatttt 

gattttaggt 

tcgcgttcgg 

gaatttgtga 

tattatttat 

gttataataa 



ttaggattga 

taagttttgg 

aaattttatt 

tttatttttg* 

tttggttttt 

ggggatattt 

tttttatttt 

ttgaatgttt 

atttatttta 

atagggaaga 

ttttttttgt 

ttttttatat 

cgaggtgtta 

ggtttattat 

agggggtttc 

ggtaggtttt 

cgatttagga 

gttcgtcggt 

cgcgcggtga 

ttcgtcgtgt 

ttcgttcgtt 

tgcgtgtacg 

gattgaaacg 

ggggatacgt 

ttttatttat 

tttttggtcg' 

agtttgggtt 

ttttttttag 

acgttttttg 

ttttcgtttt 

gcgggttttt 

cggtagtggt 

taggtcgggg 

taggtttcgt 

tgttggttat 

atagttggat 

ataataataa 

tcggtttatc 

gtatttagga 

acggggtttt 

gtttcggttt 

gggaacgtat 

ttacgtttgt 

tatatatttt 

tttttttagg 

ttagtgtttt 

ttttaggggt 

ttgtagattt 

tggtttggtt 

tttagattag 

aatatggtta 

tttttgtagt 

ttttgatttt 

ttttatggtt 

tttttttttt 

acgatttcgg 

tttcgagtag. 

ttagtagata 

gatttattcg 

t'tttttgttt 

tgtttttttg 

ttataagttt 

tttggggagt 



ttgatatgaa 
gttaagattt 
tgaagtatta 
ttaggtatat 
tgagtttggt 
aggaggtttc 
tttaggtatt 
atggtttaaa 
gaggaaggga 
ggtgagggtt 
tttttaagtt 
ttttttttag 
gtttaaattg 
aagttatata 
ggtatttagt 
cggttaggat 
cgcggtaacg 
attttttggt 
agtattgttc 
tgttttaggt 
ttttagtttt 
cgtgttagtg 
tagatcgtcg 
acgtttcgtt 
tatagggaac 
gaggcgaggg 
cgggttcgtt 
ttttcgcgta 
cgcgcggtat 
ttttatacgt 
agtttaggtt 
aggtgaattt 
aaatgtttcg 
aggttcgatt 
ttgggcgttt 
ttttatcgtt 
aaaattgcgt 
gtaattttcg 
ttataggcgt 
attatgttag 
tttaaagtgt 
atggaatata 
tttgtggttg 
aatttttcgt 
aaattgttcg 
attttttatt 
tatatttttt 
tgatttaatt 
tgtttatatt 
attttt.cgtg 
aaatttaatt 
aaaaatggtt 
ttttttgttt 
attttatttt 
tttttttgag 
tttattgtaa 
ttgggattat 
tgaggtttta 
ttttcgtttt 
attttttgta 
tatcggatga 
ttaaataggt 
ttttaaaatt 



nat aanttt a 


5220 


f- rrnnnt nt t n 


5280 

<J G* U W 


tttataattt 

L L Lu LOO L U L 


5340 




nn 

JiUU 


erat-tatatt-a 


5460 

«s *a U W 


aacratatttt 


5520 


tfal-aal-i-t- a 


5580 


ataaCjaLCtt 


0 DfJ U 


A ta rr^ a tt tt 


5700 


t ant t n at tt 
Lay l Lyy l l l 


5760 


....... 

LULL LaaaLa 


5820 


4. 4- 4- *-4- 4- 4- aarr 
LLUULLLaay 


5880 


tt f^nnt cnnt 
l L^y y uuyy l 


5940 


n-t-i-rr-hi-'hl-M- 

LjLLLJLH.I>.Lt 


6000 


attta rrrr r*rrrr 
aLLiayy y 


6060 


LL.yL^yyyyciy 


6120 


na t c nnnna rr 


6180 


f-ht-t-a-hi-tcrcr • 
LLLioitiyy 


6240 


nantttcna rr 
y ay l u uuy ay 


6300 


Let l l cuyaaa 


6360 


n r^rxr^rxt t nt 
y L>y uy uuwy l 


6420 


Ly oy uy tyuy 


6480 


rr rr a a rrt t 
yyaut Lay l l 


654 0 


f ffa rrrr a a 1 rr 
ill a.y y a. at i_ y 


6600 


rr rr rr rr rr r* rr rr rr t 

yyyyy c yyy u 


6660 


r*rrrrrfa a rrrf "H rr 
^yy yaoy y l y 


6720 


Ly Lay uyyy l 


6780 


naa t" 1 ant" 1 1 

uauu LQy L L L 


6840 


+■ 1* a rr at a cr 1 1 

l l. q y y l uy u. \- 


6900 


tttcrtcit t't r* 
l l i»uy y i-llu 


6960 


4- r'rrrTrrrr't* "h r* rr 
tcy y y y t-iuy 


7020 


rrl-aa t" i"t"ot" A 


7080 


rra rrpq"h f" rraacf 
y ay a L uy uoy 


7140 


tatt-ttacrta 

UuLLU Lay L- y 


7200 


rrrra aaf cctnt 
yycLdd LL>yy l 


7260 


+■ t\ rrt t* a rr a a 
Lay La Layaa 


7320 


tttattttta 

LLLMLLLLLy 


7380 


LLLLLLyyy l 


7440 


-rrert-tatt-at 

l l> y L l a L L a L 


7500 


ttactrtttctctt 
l Lay y l l y y l 


7560 


4- /-» f-r af f 

tggg.attaid 


7 620 


LLLLJLLLCILL 


7680 


4- 4- = 4- 4- 4- rr-i- 4- +- 
LLaLLLyLLL 


77 4 0 


ttttattttt 

LLLLdLLLLL 


7800 


ggataggguu 


7fl fiO 


+■ -r -h 3 1- 1- 1 1 1 1 

LLLdLLLLLL 


7920 


Lay L L L L L L L 


• 7980 


4.4.4- a 4- a f f f 4- 

LLLaLaLLLL 


R040 

O U *i v 


LdLLLyLLLL 


8100 


tt arrtat-r-rf- 

LLayLaLLLL 


8160 


LLLLLLLLLL- 


8220 


a tt art rfh t-f-h 
aLiayy Lin 


8280 


■h1-ai--r-ht1- era 

LLaLLLLLya 


8340 


tt 1 1 1 n a (i a a 
LLLLLyayaa 


8400 


^l-arrarrt* t*1- 1* 
a Lay ay lull 


84 60 


L L L LL»y L L L L 


8520 


s rtnt at t f^nt 
ay y l a l l L»y l 


8580 


L La Ly l Ly y l 


8640 


"t~r«rra ant at t 
l l> y a a y Ly l l 


8700 


ttttattata 


8760 


gagggttttt 


8820 


tagagtagtg 


8880 


tattgatgtt 


8940 



• •o m 



tagggtttat 
ggattgtaaa 
gtttgttggc 
ttagtatttg 
tttgtttgta 
taattttttt 



ttgggaagat 
ttttaaagat 
atttgtggtt 
agcgatgggg 
tcgcgatttg 
gcgcggggtt 
tcgttataga 
cggttaggga 
ttggtacgtt 
" agtttaatga 
gggcgttagt 
tttaggattt 
gtgtagggat 
tttgtttttt 
cggcgatcgt 
tagaggaaga 
atatttggtt 
gagggagatc 
tcggtagagg 
gtatgggtgg 
taggatataa 
tgggtattga 
ttttaggata 
aagtttttgt 
ttagtttgta 
ggtgggggta 
ttataggttt 
ttggagttgg 
ttagattttt 



ttttagtagt 
agtatttttg 
gatttttagg 
atttattggg 
gggggtttat 
ttattttaaa 
ggatttcgag 
aagaaattta 
tagatacgag 
cgaaagtaac- 
attcgcggcg 
attatttacg 
ggtcgtatta 
ttagttgtcg 
ttagtcgggt 
gtttaggttt 
aagcgcgggt 
agattgggcg 
tgtggggttc 
tttttttttt 
aaagtattac 
gggtttttat 
tatttattgg 
ggttagtagt 
ttggatcgtg 
tcggttttta 
gtttttttat 
ggatggtaat 
tttttatttg 
attgtagtaa 
ggttttagtt 
ttttatttat 
tattttcgcg 
tgtttggtaa 
gggatttagg 



tgatttaata 
gtgattttag 
ggtgtagaag 
ttagaaaagg 
taaaggtttt 
ggttttggag 
gatagtaggt 
aaattagcgt 
tggagtgcga 
ggatttagtt 
attgcgttgt 
cgcgtattgt 
cgtgtgcgtg 
tgtggggttg 
gcggtttttt 
tttcgatatg 
cgggcggggt 
gcgggaagga 
gaaggcgggg 
. tcgttttttt 
gcggagatat 
attcggtttt 
gtggggtagg 
tttatcgttc 
attttgaggt 
gtaattgtag 
gtttttttta 
gtatgtgggg 
tagtaatatt 
gtatagcggg 
tatttggggg 
atttatgttt 
tttgtagtcg 
ttgtttgtta 



ggtttgcggt 
ttggtgttta 
gattgttggg 
gtgtttgtta 
aaattttttt 
ggggttatga 
ataaattttt 
atttttatat 
taagggataa 
ttcgggagtt 
tttttggttg 
aggtttttgc 
gcgggtttcg 
tacgcggtgt 
ttagcgcgtt 
gttcggttgg 
tatagggcgg 
gttgaggaga 
ttgggcgcgt 
tttatatttt 
tcgtgttttt 
ggtttttttg 
aagtttcgag 
gttttgtttt 
ttaggggtgt 
tttttatttg 
gtgtttgatt 
gatgtgggag 
atttattttg 
gtcgttttgg 
aaagttagga 
tttgtatttt 
taggttttgt • 
tttgtttttg 



gggatttagg 
tttaggggag 
tgtgtggttg 
aataaagatt 
aggttttttt 
gtgtttgaga 
ttttaagaag 
ataagtagtt 
attattttcg 
gttttcgtcg 
tttttttcgt 
gtacgacgtt- 
cgggttggaa 
ttcgcgcgat 
tagcgggtgt 
gttcgtgttt 
gtttcgattt 
gtcgcgtaat 
tttcgtagag 
atttcggacg 
ggaggttagt 
gttcggtttg 
tttttatttg 
ttattgcgga 
attttgggtg 
gttttgttat 
tggtattttt; 
tagggtttag 
gtatcgtgag 
agttattgtt 
aggtttgatt 
ttttattttt 
tttgaggggt 
ttttttcgtt 



ttagcgggga 
agtaattttt 
cgtgtatatt 
taataaaatt 
ttataggtgg 
agaggtaagt 
ggttaaggta 
atttttgttt 
cgtatttttt. 
attttttttg" 
tttcgtaggc 
ttagatgaag 
gcggtggtta 
gcgtagcgcg 
tagttttcgt 
cgttggtttt 
tagcgttttt 
ggaaatttgg 
tttttttcgt' 
gttataacga 
tttattgtgt 
ttgaagtaaf 
gggtgaggag 
gattggggtt 
gatttttttg ■ 
tttgggttgt 
tgtaggtagg 
aggtttaagg 
tagcgtttag 
tttagtatat 
ggttttggaa 
tttgttattt 
tgaatatatg 
ttagtcgttt . 



<210> 15 
<211> 11001 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated .genomic DNA (Homo sapiens) 
<400> 15 



9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540' 
9600 
9660 
. 9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
10920. 
10980 
11001 



ttttagattt tagaaatttg ggagcggttg gagcgagaaa atagaggtaa gtggtaggta " 60 

attgttaagt attagtttta gtatgtgttt agttttttagi agtaggattt gcggttgtag 120 

gcgcgaaggt aaggtttgtg gaaatggtag ggagggtgga ggggatgtag gaggtatgga 180 

tgtgggtggg gtgtttttat tttttagggt tagttagatt tttttgattt ttttttaggt 240 

gggttgagat ttataggttg gatgtgttag aggtagtggt tttagagcgg tttcgttgtg • 300 

tttattgtag tgtagaggtt tttaagcgtt gtttacgatg ttagaatgag tggtattgtt 360 
gtaggtgagg gtattttaga attttggatt tttaagtttt atttttatat tttttatatg - 420 

tattgttatt tttaatattt atttgtttgt agggagtgtt aagttaagta ttgggaaaag 480 

.tatggaaaga tttgtgtttt ggtagtttag ggtgatagag ttaaatgagg gttgtagttg 540 ' 

ttgagggtcg attatttatg ttaagggaat ttatttagaa tgtatttttg aattttaaga 600 

ttacggttta gtttttgtcg gagttttagt " tttcgtagtg gagagtagag cgggcggtaa' 660 
agttgttgat cgattttttt tttttttatt ttaagtgaag gttcgagatt ttttgtttta ' 720 

tttagtgggt aggttaagtg tgttgtttta gtaaatcgga ttaggagggt tagggtcgga 780 

tgtggggatt tttttttttt agtatagtaa agttggtttt tagaaatacg ggt^ttttcg 840 

cgtggtgttt tgcggtcgtc gtcgttgtgg tcgttcgggg tggggtgtga ggaggggacg ' • 900 

aaggagggaa ggaagggtaa ggcggggggg gt tttgcgag agcgcgttta gtttcgtttt ' 960 

cgggttttat agtttttgta tttaggtttf tattgcgcgg ttttttttag ttttttttcg 1020 



• • • 



tcgtttagtt 
ggttcgcgtt 
ggagtttgag 
tattcggttg 
acggtagttg 
gtggtgcgat 
gcgtgagtag 
tcgtcgcggg 
cgttattttc 
. attcgtattt 
ttaagttttt 
tttcggagtt 
ttttagggtg 
ggtgggtttt 
atttagtgaa 
ttttgggaat 
ttaggggtgt 
aattgttggg 
tatggagttt 
gttttaggtt 
tagttggggg 
gtggtttatg 
taggagttta 
aaaaatgagt 
gggagaattg 
ttttagtttt 
taaggttggg 
aggatgattt 
ggtttttgtt 
gtaagtgtag 
cgttttttag 
tagtgtttta 
attgagttta 
gtaggagttg 
agtaagtttt 
ggtagtatat 
ttggttggag 
gttttggttt 
gtttagtttt 
tcgggtgcgg 
atttgagatt. 
aaatataaaa 
tgaggtagaa 
attgtatttt 
tgttaatgag 
ttttttgttt' 
gcgtagtagt 
ggtttttttt 
cgttttatcg 
gttcgtttgt 
ggatacgaac 
tatagtgtag 
cgatttttcg 
tgtcggtttc 
ttcgcgtacg 
tgcgtttgtt 
ggagtttcgg 
tttttgtttt ' 
gcggcgcggt 
tcggggcggt 
gggcgaggcg. 
ttgtagttaa 
agttcgtagg 



tggattttgg 
tgttagcgtt 
tttattgagt 
gagcgtgtta 
gtttttggtc 
ttttgtggcg 
tggtttcgcg 
ttaagtcgcg 
gttttatcgt 
gggttataga 
tatttttaaa 
tattttttta 
aagggaaatt 
ttgtaagtag 
ttagatgtta 
cgttagtaag 
ttttatagtt 
agtggatttt 
" gggttgagta 
tgtgtgtgta 
gagtataaag 
tttgtaatt-t . 
agattagtt't 
taggcgtagt 
tttgaatata 
ggcgatagag 
atattgtagc 
gggtatgttt 
tttagtaatt 
tagtgaggtt 
atattttagt 
gtcgtggatt 
gattttatcg 
tagggggtag 
aaatgatgtt 
agatcgttga 
tgggtagagg 
gaggggagta 
ttttttaagt 
tggtttaagt 
aggagtttga 
•attagttggg 
gaatcgtttg 
agtttgggtg 
gaaaggggaa 
gtggtttttc 
gtgaggtaat 
ttggggtttt 
ttaggttttt 
ttggattgtt 
gtggtgagcg 
gtattttcga 
gcggtataga 
gta'agtattt 
cgtatttttc 
cggacgttgg 
agcgatgagg 
aatgagataa 
tttggttcgg 
ggttggcggt 
gaggcgtttt 
gaggattcgg 
tcgcgtcggg 



gggaggcgtt 
taaagttagc 
tgcgggagtt 
acgcgttgcg 
gtggttatcg 
attttatttg 
cgtttacgag 
gtagaggggg 
tgaagagtgc 
tgagtagagg 
atgttttggt 
ggtttgtttt 
attatttatg 
gagttttgtt 
aaatatgta'c 
taaaggttgt 
tttttcgtta 
aggtattagt 
ttattgtttt 
tggagatttt 
ttgtgataga 
. tagtatttcg 
ggttaatatg 
ggcgggtgtt 
ggaggcggag 
tgagattttg" 
gtttttaaag 
attagaggta 
gggagtttag 
gaagaaaagg 
gaaggtattg 
tggggtagat 
tgagatatgg 
gaggaaaatt 
gagggggtga 
ggattttgtt 
ggagagtgaa 
gagataggtg 
aaatgggtag 
ttgtagtttt 
gattagtttg 
tatggtggcg 
aatttaggag 
ataaaagtaa 
gttttgtgtt 
ggtcggtttt 
ttattgagat 
tttttaattt 
ttgtagattg 
gcgggtttcg 
cggggtcgag 
gggttgtttg 
gaggttaatt 
tattcgttgt - 
gtattttttc 
tattcgtttt 
ttatttttgg 
gagttagatt 
gcgtacgcgt 
tacggacgcg 
ttttcgaaat 
attcggagtt 
attaggtggg 



gaagtcgggg 

gaagtacggg 

ggtattcgtt 

tatcgcgcgg 

tttttagttc 

gggcgtcgtg 

agcggaaggg 

tcggcgggga 

gcgaaaatgg 

tggttgttta 

tttttttgag 

tttttaaata 

ggagggagtt 

gagtttttat 

gtagttatat 

ttttttttgg 

gtttggattt 

aaattttaaa 

ggtttattta 

tttattcggt 

atgtaaagaa 

gaaggcggag 

gtgaaatttt 

tgtaatttta 

gttgtagtga 

-ttttaaaaaa 

agaaataaag 

gagataaggg 

tggttatttt 

gaattaaatt 

gtacggagga 

gaatgtagat 

aagttgagtt 

gggagagtgt 

gaatggagaa 

tcgggtagtt 

ggcgaaggat 

ataattatag 

atgtatttta 

agtattttgg 

gttaatatgg 

ggcgtttgta 

gcggaggttg 

gacgtagttt 

aggcgataga 

tagacgttta 

aggtcgggtt 

tcgggatat't 

taagtttatt 

ggatttgggt 

ggcgtatggg 

ggtgtcgcgc 

ttgcgcgggg 

aagcggattc 

gttttcgttt 

cgttttttgt 

gggcgaagcg 

tcggcgattt 

atattgatac 

taggattggg 

gatttggagt 

cgagtagtat 

agttaggggg 



ttcgttttgt 

tttaatcggg 

gggcgcgttg 

ggtatcgcgt 

gcggggttcg 

cgtaaaggtt 

gtagttaagg 

tagttttcga 

tttatttttt 

tatgtaaaaa 

aaagggtttg 

tttatgattt 

tggaaaaatt 

ttagtaaata 

atttagtagt 

gtagatatta 

tatcgtagat 

aattttttaa 

ggaatttgtg 

ataagaggat 

tgaataaggg 

gcgggtggat.' 

atgtttatta 

gttattcggg 

gtcgagatcg 

aaaaaaaaaa 

tagttatgga 

agaaattaaa 

tgttgtaaag 

ttggttatgt 

tttagtttga 

agattaggtt 

agaatttgta 

agtttttggg 

tggaatattg 

ttttggagga 

tagagtgtat 

ggtagacgta 

tatacgtttt 

aaggtcgagg 

•tgaaatttcg 

attttaggta 

cggtgagtcg 

tttgttgttg 

gatttaattg 

ggtggttaat 

tgcggagttt 

ttttcgattt 

tgttattatc 

tgggaattcg 

aagggcgagg 

gtaaggagcg 

gttgggaggg 

gggtttaggt 

tcggttagag 

ggtaggtggg 

tgcgtgtttt 

acgttttagt 

gcgtatacgt 

ggacgggcgg 

agtacgacga 

tttatcgcgc 

tgtcggcggg 



ggtttcgttc 

ttatgtcggg 

ggaagggtcg 

gtaattttat 

ttacgtatac 

tgtagtgcgc 

ggtagcgtag 

ggattaggtt 

gtcgtatttt 

tacgttgatt 

tgtttattgt 

tttttagaat 

tagaattttt 

tttttttttg 

ttttttgtat 

gttggaatta 

ttgttgaatt 

attattgtaa 

gatggatagt 

attataaatt 

gtcgagcgcg 

..tatttgaggt 

aaaaataaaa 

aggttgaggt 

tgttattgtt 

aaaaaaagaa 

gataagaagt 

gataagtttg 

aggaagttgg 

ttatttgaaa 

gggtttaggt 

agtgattagg 

aaggagttga 

agttaaaggg 

gattttattt 

agaggtaagt 

agagattagt 

ggttaaaggt 

tagtgaaggg 

cgggtggatt 

tttttattea 

tttaggaggt 

aaatcgcgtt 

tttttttaat 

ttgagtaggt 

attagagttc 

ggcgagtagc 

ggagtttttt 

gttgtcgcgc 

cggtggagcg 

atgggtaggt 

ttttaattgt 

gagtttggat • 

tgatttaggt 

gttatttttg 

gtttgtgagt 

cgtttcggcg 

tttaacggtt ■ 

acgtacgcga 

gtacggttat 

gtagtggtta 

gaatttcgtt 

cgggagggga' 



1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 ■ 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

414 0 

4200 

4260 

4320 

4380' 

4440 " 

4500 

4560 

4620 

4680 

4740 

4800 



- ~ AmJ " • • 
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agcggtcgtt 
tttatttttt 
gtgcgtaaga 
ttatagatag 
tattatttag 
ttcgaatgtt 
ttcgttagga 
ttttatttgt 
" ttattagttg 
gatttgggaa 
atttatagga 
aggttattaa 
gaattgtgtg 
taggaaaaag 
tttatttaag 
ttttataaaa 
gttgagttga 
tattgtggaa 
ggaggtgtat 
ttgggaggta 
gagtgagatt 
tgttaatgtt 
tttggttaat 
ggtgtgtgat 
gttgtagtga 
ttttaaaaaa 
ggaggttaag 
gtaaaatttc 
gtaattttag 
ttgtagtgag 
tttaaaaaaa 
ttaaaaaa'aa 
ttagtatttt 
gttaaaatgg 
aggtatttgt 
gtagagattg 
attttgtttt 
ttttatggtg 
gaaatttttt 
ttttggaagg 
gtttattatt 
tgggttattt 
ttattataag 
tttttttttt 
. tt'atttgggt 
gtttttagtt 
tttattgttt 
gagatttttg 
ttttttttgt 
gtagcgggtt 
tttttttatt 
gggagaattt 
atttttttgg 
tttaattagg 
gtttgggaga 
ggtaagaatt 
tcgggtacga 
atttgaggtt 
aaatataaaa 
gtttgaggta. 
gttattgtat 
ttgggttaaa 
tatttttggt ' 



ggagtttcgt 
tggttacgtg 
tttcgatttt 
gttgaatacg 
ttatgattaa 
tttgtttttt 
attgagaagt 
atttttgtat 
ttggaatgtg 
tagataggaa 
agggggaaag 
tttaggattg 
ttagatgtta 
aaatagggat 
taaagaaagt 
tgggtggtgt 
gtgtagattt 
aggtgggaaa 
atttgtaatt 
gaggttgtag 
ttgttttaaa 
ttgggaggtt 
atggtgaaat 
tgtagtttta 
gttaagatcg 
aagaaagaag 
gtaggtagat 
gtttgtattg 
ttattgggga 
tcgagattgg 
aaaaaaaaaa 
agaaaaaaat 
ggaaagtcga 
tgaaatttcg 
aattttagtt 
taatgagtta 
aaaaaaaaaa 
tagaagggaa 
tttagtaggt 
gttgtgtagt 
agaagggaag 
aggaagggaa 
aataagttag 
gttttggtat 
attttttttc 
tttatttttt 
ttagtaaggt 
gatatttggg 
aaggttttgg 
ttgggtaagg 
tgtttttatt 
tggttttttt 
gataaagtat 
gtttatttga 
agaaagtcgg 
tagtttttag 
tggtttacgt 
aggagtttaa 
attagtcggg 
ggagaatcgt 
tatagtttgg 
tttttagata 
agattggagt 



ttttttcggt 
gtttttcgcg 
atcgtttaga 
gaaaaagt'ag 
agtcgatcga 
ttttggggag 
atgtttagga 
ggagttagtt 
gaagattttg 
agaagttttg 
ttagattata 
taaggtgttt 
ttggtgtgat 
tgtgaagttt 
ggagttgtga 
ttagtacgtt 
ttgggtttgt 
attataagat 
ttagttattt 
tgagttaaga 
aatagaaaag 
aaggtaggtg 
tttattttta 
gttatttggg 
tgttattgta 
gttgggtttg 
tatttgaggt 
aaaatataaa 
ggttgaggta 
gttattgtat 
aaaagaaaga 
attagtttta 
ggtgggtgga 
tttcgattaa 
atttgggagg 
agatcgtatt 
ttattagttt 
attttatggt 
agttttggaa 
gtttttaatt 
gggttggaaa 
gaaggagtaa 
tattattttt 
tagagtttgt 
gttggtgtat 
agggtttata 
ttgtattggg 
gaagaaaatg 
ttttagtttt 
agtattttgt 
agatggttta 
gggttaggta 
tggttagcgt 
cgttaatagg 
tttagagttt 
atgaggtggg 
ttgtaatttt 
gattaatttg 
tgttgttgtg 
ttgaatttag 
gcgatagagt 
gtataggtag 
ttggtttaaa 



tcgttgtcgc 
ggttttggtc 
tgttgggtgt 
ttgtatggtt 
gtagtttgga 
attaggggag 
aaatttgaga 
gagtttttat 
tttttttttt 



ttttggatta 
aaatgtttaa 
cggaattttt 
tattaggttt 
agtatgtttg 
gtgatgtttt 
aaaattttag 
ttttatgtta 
attaattaat 
gggagggtga 
tcgtgttatt 
gaagttaagt 
gattatttgt 
ttaaatatat 
agattgaatt 
ttttaatttg 
gtgatttatg 
taagagttcg 
aaaattattt 
ggagtattat 
tttagtttgg 
ataggaggtt 
ggttaggtgt 
ttatgaggtt 
aaatataaaa 
ttgaagtaga 
attgtatttt 
ttatggatag 
tttgttgtgt 
atagaaaaag 
taagtttgtt 
tgaggttatt 
gtttttttat 
gtgttttttt 
ttggtatttt 
tgaataaata 
ttatttgttt 
gtttttattt 
agtttaaatt 
tagttttata 
taaaacgttt 
agtgtttaag 
taagattatt 
tga.gtttagt 
aagtaaggtt 
tggttgtttt 
gagtgagtgg 
agtattttgg 
gttaatgtgg 
gtacgcgttt 
gaggtagaat 
gaggtttcgt 
gtgtagaaat 
gttgtttttt 



gttttgggtc 
ggggatttgt 
. cggggttttt 
tgtggtagat 
ttagtatttc 
gatgtggaga 
ggtagagaga 
tttttttttg 
ttagggtgga 
taagtattta 
agaggtggaa 
taggtatttt 
agagaattag 
gtagaaatgg 
agataaaatt 
tttagagttt 
gttagttttg 
tgaaaaggag 
.ggtagaagga 
gtattttagt 
acggtggttt 
aattaggaat 
aaaaattagt 
attttaatcg 
ggtgataggg 
tttgtaattt 
agatttgtta 
ggttatggtg 
ttgaatttag 
atgagagagt 
gagaagtttt 
agtggtttat 
aggagtttaa 
aattagttag 
gaattgtttg 
agtttggaaa 
tggtagagtg 
attcgattgg 
aaattttttt 
ttttgagtga 
gtattttagt 
tgttaggtag 
tgttttgtaa' 
ttttgttttt 
tatttattgt 
ttattaattc 
tagtgttttt 
tttatttttt 
tttttgttgg 
tattttgttt 
gggattttag 
ttataggaaa 
tgtgtttgtg 
gatgtagtgg 
gttttatatt 
tcgagttaaa. 
gaggtgaagg 
tgaaatttta 
gtagttttag 
ttgtagtgag 
tttaaaaaaa 
ttattaggaa 
atgtagtttg 



ggtgggtagt 
tcgcggaatc 
ttggtttttg 
ttgagtcggg 
gattttcgcg 
gggaagagtt 
gattttgttt 
ttttggtttg 
tttggagaaa 
ggagtatttt 
aaagagatt't 
tattatcgga ' 
gtttaggtat 
ggcggaaatt 
ttataaaatt 
gggtgtaagg 
agttattttt 
ggttagttac 
ttatttgaat 
ttgagtgata 
atatttttaa 
tcgaggttag 
cgggtatggt 
ggaggtaaag 
tgaggttttg ' 
tagtattttg 
ggttaatata 
gtgtgtgttt 
aagatagagg 
' aagattttgt 
aagttatatg 
atttttaatt 
gattagtttg 
ttgtggtggt 
aatttaggag 
atagagcgag 
gagggtgggt 
gatggttgtt 
tt.ttttagaa 
agatagggag 
ttagggtttt 
gagtttagag 
ataaaatgat 
agtatttttt 
'tttatttata 
gataaggttg 
ttttatttag 
tttttttatt- 
ttgtagaata 
ttttatttgt 
ggcggagtta 
ttttgtggga 
atattcgtat - 
ggttaaggga 
ttattttttc 
aatttttggg 
taggcggatt 
tttttattaa 
ttattcggga 
ttaagattta 
aaaaaaattt 
gttgtttgtg 
ggttaaggtt 



4860 
' 4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 

• 5460 
5520 
5580 

' 5640 
5700 
5760 
5820 
5880- 
5940 
6000 ' 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 " 
6720 
6780 
6840 
6900 
6960 

7020 

7080 

7140 . 

7200 
■ 7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 . 

8280' 

8340 

8400 

8460 

8520 

8580 



aaatattatg 
tattattttg 
ttgtagtggg 
ataaattttt 
ttttattgat 
cgatggtacg 
tftagttttt 
tttttagtag 
ggtgatttat 
. tagtttgttt 
tgtagtgata 
• attttagttt 
tttttttttt. 
gaggattaag 
attatattta 
tttggttttt 
tttagtgttg 
atttatttag 
tatatgtata 
ttatttttta 
taggacgagt 
attaaggagt 
tttaggaagg 
tataattgag 
taagtagatt 
gaataaggac 
ttttgtgttt 
gggtataagt 
gtatgtgttg 
ggtggtggat 
tgggtttttt 
ggggttgagt 
agttgttgga 
ttttgagttt 
tagttttttg 
aggaattgtg 
ttaaagtaaa 
aagaattgtt 
tttatttaat 
aaggtattag 
attggggaat 



ttatagtgat 
tttattggtg 
ttatggtttc 
attttaattt 
tttttttttt 
attttagttt 
taagtagttg 
agacggggtt 
tcgtttcggt 
ttgttttttg 
cgataatagt 
tttgaatagt 
ttagtagaga 
tgattttttt 
gtttgatttt 
ggttatgtgt 
aggttttagg 
agtatgtaaa 
ttagggatag 
gagaagtttt 
ttgtggagtg 
ataatgttta 
.gggcgtgggg 
gttattttgg 
gatattttat 
ggtttttata 
tttagtaaga 
gttaggcgtc 
ttacgtataa 
gattttcgta 
attttatttt 
agttggtttt 
tgatgagtag 
tgttttgaag 
gttgttatta 
ttggaagata 
ggtttttaga 
ttggtgtttt 
gatatgagga 
gtttgtggag 
atttgatttg 



tttttttatt 
gttggattat 
gtgtcggggt 
aattagggtg 
tttgagatcg 
attgtaattt 
ggattatagg 
tttttatgtt 
tttttaaagt 
tgttttgaag 
ttattgtagt 
tgggattata 
tgaggttttg 
attttagttt 
attttaaatg 
ttttagttgt 
taggtagtag 
attaggagta 
ggggttaaag 
gggtttttta 
ggagatattt 
gattaatagt 
aggagtaggt 
gtgggcgtaa 
ttggaattta 
taggttttat 
agttattttt 
gtattttttt 
gggtacgtga 
agtttgtatt 
gttttatgtg 
gtttttgatt 
tttttgcggt 
gaaaatgatt 
gataggattg 
agttttgtaa 
ttatttatgt 
gatttttttt 
ggtttttttt 
tgaaattaga 

g 



atgtgtgaga 
tgatgtgtat 
agaggagaaa 
ttgtgattag 
agtttcgttt 
ttatttttcg 
tatgtgttat 
ggttaggttg 
gttagaatta 
tagggtttta 
tgtaattttt 
ggtatattat 
ttatgttgtt 
tttaaaatgt 
agagtttttt 
ttttgttttt 
agagaagtta 
gtggtgggtt 
gtagttagtt 
agggttttgt 
gatttttttt 
aattttttta 
atttagttat 
gtagtaattt 
ttattaaggt 
taaggtttag 
tttgtaggat 
ttatttgttt 
agttattgag 
ttttgagcgc 
agggtatata 
attatttttt 
tgcggttttt 
ttggggaggt 
atgggttgta 
aatagtttta 
taaagtttag 
ggtgtggaaa 
agatttttta 
tttagaatat 



tatggagaat 
tattttttat 
aatgggttgt 
aatgtgtaat 
ttgttgttta 
agtttgagta 
tacgtttggt 
gttttaaatt 
taggcgtgag 
tttagttttt 
cgggtttaaa 
tatatttggt 
taagttggtt 
tgggattgta 
tttagagttt 
gtggtatttt 
aatgattttg 
tagggtgggt 
tttaaagatt 
gtttatgttg 
aagttgagat 
tgagtttggt 
gtgtttgata 
gtgtatattt 
ttggtttttt 
ttgaagttgg 
gttcggcggg 
aaggatgttg 
gttttgttgc 
gttgagcgtt 
cgtatgtatt 
ttttatagtg 
ggtagggttt 
gaacgtgagt 
gttatagtaa 
ggagtgtata 
ggttgtttta 
attttttgga 
tttggaagtt 
gtttgatttg 



tggttttaag 
tttttttatt 
tttttttagg 
tgaggtgtga 
ggttggagtg 
atttttttgt 
taattttgta 
tttgatttta 
ttaacgtgtt 
taggttggag 
cgattttttt 
taattttttt 
ttaaattttt 
gatgtgagtt 
tttagttgtt 
taaggttata 
tttttttttt 
attagttatg 
gttttagagg 
gtttattttg 
tgagtagaag 
gagttgattg 
tttagagggt 
agtttagttt ■ 
tattttttta 
tgcgttttgt 
gtttaggacg 
ttaagtattt 
gaaagttttt 
acggttaagg 
tgagtatgta 
tatttgcgga 
agtgataagg 
atatagtttt 
ggtttggagg 
ggtattgtaa 
agaagttagg 
gatgtaggag 
ttttggtttt 
tttataggta 



8640 
8700 
8760 
8820 
8880 
8940 
9000- 
9060 
9120 
9180' 
9240* 
930.0 
9360 
9420 
9480 
9540 
9600 
9660 ■ 
9720 
9780 
9840 
9900 
9960 
10020 
. 10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
10920 
10980 
11001 



<210> 16 
<211> 3501 
<212> DNA " 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo* sapiens) 
<400> 16 



gtggggttgg- 
attggttaag 
atgggttgtg 

aggggagttt 

ttgtgagttg 
tttttttaag 
gaggtgttgg 
gtttgggtta 
ttttttggga 
ttttgatata 
ttgttgtttt 



taggggtgtt 
agttgttatt 
gggttttgtg 
agggtttggt 
agggtttggg 
tttggggtgg 
gtttgtgtgt 

gggttgtagt 

gttgtgtggt 
atgttttttt 
tgttttttgt 



gttttggtat 
gttgtgggga 
ggttaggagg 
tttgggtttt 
gttggtgtat 
tggagttttg 
gtagggttta 
tggttgtgtg 
tttttttttt 
tgtgttgggt 
gtggttgttg 



agtttggg.gt 
ggggagtttg 
ggaatggggt 
gttgttggat 
tttttttgtt 
ggggagaagg 
ggttgaggtt 
ttgtgtttgt 
tttttatttg 
ttggtttttg 
tttgggtttt 



ttggtagtgg 

gtttgttggg 

tgggtgggtg 

ttgggggttg 

ttgtttgtag 

ggttggggga 

gggatgtggg 

ttggggtgtt 

ttttttgttt 

tgttttgttt 

gattgagggt - 



tgggtggggt 
attgtaggta 
agtagtgggt 
tgaggaagag 
ttggaaaatt 
gttgtggagg 
tggggtgtag 
gttttttttt 
tagttttttg 
gttatggtag 
tgatagtttt 



60 
.120 
180 
240 
300 
360 
420 
480' 
540 
600 
660 



<2X- 



tggttagggt 
tggggtaatt 
ggtaaagttt 
attgttttta 
gttaggtgtt 
ttttgttttt 
gttttttgtt 
gtggagggga 
ggttggtgtg 
tgtttggtta 
tgtgtgatgt 
tggttagtag 
atgatatggt 
tggttgatgg 
tgggtgttgt 
agaaatgttt 
gtggttatgt 
aggtttgtta 
.gtttagaggt 
ttgttgtatt 
agaagagttt 
tgtgtttgga 
tgttgttgtt 
tattttgtgg 
gtttttttta 
gttgggagtg 
tgtttggggg 
ggtttggtgt 
gtgaggattt 
atggtgagtt 
tttttagttt 
tgtatggtag 
ttggaggtgg 
tttattgttg 
agaagatgta 
agtgtgggat 
atgtgtgtgg 
tgggtgagaa 
ttattagtta 
tgggtttggg 
ttgagggtgt 
ataaggtggt 
tggtgttgga 
ataaggtggt 
ttattgattg 
gtggttttgg 
ggataattgt 
ttgttttgtg 



ggtgttaggg 
ttttttgagg 
tttgttttga 
aaagggttat 
ttggggtgtg 
ttttttggtt 
ggtgtttagg 
gaagtgttgg 
tgtttttgta 
ttttaggtag 
gattattgtg 
tgtttatttt 
gagtttggtt 
tgtagaggtg 
gttggttgtt 
taagtgttat 
gttttatggt 
tttgttftta 
tgtggttaat 
ttgtgttttg 
gttgggtttt 
tagttttttt 
gttgttgttg 
tggtagtggt 
ttgttggatg 
tggttttttt 
gtttttgttt 
gggtggtgat 
tagtttgttt 
tgagagtttt 
tgagtagttg 
ggttgaggtg 
tggggataag 
tgtgttgtgt 
ttggttgatt 
tatgatgtgt 
tatgtggttt 
gttttatgag 
tatgaagatg 
ggggtttttt 
ttttgttgtg 
tgtggttgag 
gagtttttat 
tgaggtgttg 
tttttttttt 
tttgtatttt 
agtgttgtta 
ttttagtttg 



tgggtattgt 
tgggaggtgt 
atgatttttt 
tgtgtttgaa 
taggttgttt 
tgggtgggtt 
ttgtagggtt 
gttggggtta 
ggagagtgtg 
ttgttgttgt 
gtgtagaatg 
aagtttttgg 
gtgttttgtt 
gttgtggttg 
gttagttatt 
ggtaagtatt 
tggttgggtt 
gttgggtttt 
atgtattgtg 
gagtgttgtt 
gtggtgttag 
agtgttggtt- 
tttttttaga 
agtttgggat 
aagtatgagt 
agtgagtgtt 
ggtttggtgt 
ggtgatgatt 
ggtggttatt 
ggtgataatt 
aatgtgtatg 
gttgaagtgg 
gttgttgggg 
gataagagtt 
tggttttatt 
tatatgtgta 
atgtgttagt 
tgttaggtgt 
tatgttgtgg 
ggtgtttttg 
gtttgtttta 
ttgttggtgt 
ttgttggtta 
agttagggtg 
atttagagtg 
agggagtggt 
tagtggtggt 

g 



gttttttttt 

tggttttttg 

ttgaagtgga 

tagttttttt 

tggttttttt 

ggtttgggtt 

gatgtttttg 

ggtggttagg 

ttgggtagat 

agtttaataa 

tttttttttg 

tggtgtatga 

tggtgttgga 

tggttgtggt 

tgtagatttt 

gttatttgtg 

ggggtttgtg 

tgttgttgtt 

ttgagttgta 

gttttttttt 

agtggttgtt 

ttgttgttta 

agttggagga 

ttgagttttt- 

tgggtttggg 

gtgaagagtg 

tgttgttgtg 

ataagagtag 

ttgagggtta 

tgtatgtgtg 

tggaggttta 

ttgttggggt 

ttttgggtgg 

ataaggattt 

tatgtattat 

gttatttggg 

attgttttat 

gtggtggtaa 

ggggtgtggt 

gttttgatgg 

tggttgagta 

agattatgta 

agtttatggt 

tttatttggt 

ttttttgtta 

gggggtggtg 

tttatttttt 



tttgtattat 
gttttttttt 
tattttattt 
tttggaagtt 
tttattggtg 
tttattttag 
tttagttgag 
gtgttgtatg 
gatgttggat 
ttagtgtatt 
tgtgtataag 
taatttgttt 
ttttatttat 
tttg[ggggtt 
tgattttgtg 
gggtggtggt 
ggttgttatg 
tgttgtggag 
tgtgttggga 
ttgtggtttg 
ggttgagtgt 
taaggagttg 
ggttgtattg 
tggttgtttt 
tagttatggt 
tggtggggat 
ttattttggt 
tagtgaggag 
tttatgtttg 
tattttgtgt 
tgtggaggag 
tgttggttta 
tttgggagag 
ggttatgttg 
ttgtgggaag 
ttttaagttt 
ggagtatatg 
gtttgtatag 
tggtgtggtt 
taagggtaag 
gttgagtttg 
ttttttgtat 
tgagttgggt 
ggttgggttt 
gtttgttttg 
tgtagggttt 
ggtggtttta 



tttttttaat 

tttttatttg- 

aaattgggta 

ttagtattta 

gttgtttatt 

agggtagttg 

ggaaggggaa 

gtttttattt 

atgatggagg- 

aagggttttt 

aatgtgttgg 

aatttggatt 

attggttgtt 

gagttgagtt 

gtgttgtgta 

ggtggtggtg 

ttggttattt 

ttgtttttgg 

tttggtttgg 

gatttgttta 

gagttgtttt 

ttttttgttt 

ttttttgatt 

gatgggttta 

gatgagttgg 

gtggttgttt 

agtttggatg 

attggtagta 

tatttggttt 

ggtaagggtt 

gaggaagtgt 

gggttttttt 

ttgttgtggt 

tggtagtatg 

aagtttatgt 

tttgtgtgtg 

tgtatttatt 

taatgtaatt 

ggggtgttgg 

tttgattttt 

aagtagtagg 

gattttaagg 

tttagttttg 

gatggttgga 

ttgttgttgt 

attgtgtttg 

tttggtttta- 



720 
780 
840 
900 
960- 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
'2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3501 



<210> 17 
<211> 3501 
<212> DNA 

<213> Artificial Sequence - 
.<220> 

<223> chemically treated genomic DNA (Homo ' sapiens) 



<400> 17 



ttgagttaag gtatgaagta gtgaggttag gtgaggttgt tgagaggtgg agttgttatt 

gtggtgatgt tgtggttgtt ttgggtatag tgggttttgt gtgttgtttt tgttgttttt 

tggggtgtgg gttagggttg tgtagtagtg atagagtggg ttggtgaggg gtgttttagg 

tgggagagaa atggttgatg gtttggttgt tgggtttggt tgttaggtga gtgttttggt 



60 
120 
180 
240 . 



^3- 



ttagtatttt ggttgttttg ttggggttga ggtttagttt ggttgtgaat ttggttagtg 300 

ggtagaggtt ttttagtgtt attttggggt tgtgtaggaa gtgtgtggtt tgtgttagta 360 

gtttggttgt ggttgttttg ttttgttgtt ttaggtttag ttgtttggtt gtgaggtgag 420 

ttatagtaaa gatgtttttg gggaagttga gtttgttttt gttgttgggg ttggggatgt 480 

tggggagttt ttttaagttt gttagtgttt . tggttgtgtt ggttgtgttt tttatggtgt 540 

gtatttttat gtggttgatg aggttgtgtt gttgtgtgaa tttgttgttg tatatttggt . 600 

atttgtaggg ttttttgttt gagtggatgt gtatgtgttt tgtgaggtgg tattggtgtg 660 

tgaattgtat gttgtatgtg ttgtatgtga agggtttgag gtttaggtgg ttgtgtatgt 720 

ggtgtgttat ggttttatgt tgtgtgaatt tttttttgta gatggtgtat gggtagggtt 780 

gggttagtta gtgtgttttt ttgtgttgtt gtagtgtggt tgggtttttg tagtttttgt 840 

tgtatgatgt gtagtggtag ggttgtagta gtttttttag gttatttgga gttttggtga 900 

ttttgttttt gttgttttta -aaagggggtt ttaggttggt ggttttagtg gttattttgg 960 

ttgttttggt tttgttgtat agtgtttttt ttttttttat gtgagttttt atgtgtgtgt 1020 

ttagttgttt agagttgggg aagtttttgt tgtatggaat gtatatgtat aggttgttat 1080 

tgaagttttt gggtttgtta taggttaggt gtgggtatgg gtagtttttg aggtggttgt 1140 

taggtgggtt ggggtttttg ttgttattgg tttttttgtt gttgtttttg tagttgttgt 1200 

tgttgttgtt tgtgttgggt ttgtttaggt tgttagggta gtgtggtggt ggtgttaggt 12 60 

tgagtggggg ttttttgggt gagatggttg tgtttttatt atgttttttg tagtgtttgt ' 1320 

tgggggagtt gtgtttttgg tttagtttgt tgttatagtt atttaggttt ggtttgtgtt 1380 

ttatttagtg atagaggaga ttaggtttgt tggggtggtt ggggggtttg ggttttgggt ' - 1440 

tgttgttgtt gttgtgaaat gggttggaag gtggtgtggt tttttttagt ttttggaagg 1500 

gtagtggtgg .tagtgatggt agggtgagag gtggtttttt gtaggtggtg. gggttggtgt 1560 

tgggagggtt gtttgggtgt gggggtagtt tgtgtttagt tagtggttgt tttggtgttg 1620 

tggagtttgg tgggtttttt ttggataggt ttaggttata aagaggggag tagtggtgtt 1680* 

ttgaggtata gagtgtggtg gttgggttgg gttttgatgt gtatagtttg' gtgtagtgtg 1740 

tgttgattgt ggtttttggg tttgagggtg gttttgtggt aggtggtggt ggaggtttga 1800 

ttggggatgg gtagtaggtt tggatgattg gtgtggtggt ttgtaggttt tggtttggtt I8 60 

gattataggg tgtgtagttg ttgttgttgt tgttgttgtt ttgtaggtgg tagtatttgt 1920 

tgtggtgttt gaggtgtttt ttgtatagtg ttatgaggtt ggggatttgt aggtagttgg 1980 

tggtggttag tatggtgttt aggtttggtt tagtttttgg ggttatggtt gtggttgtag 2040 

ttgtttttgt gttgttagtt aggtggttgg tgtagatgaa gtttagtatt aggtggaata 2100 

tggttgggtt tattatgtta tggtttaggt tgagtaggtt gttatgtatt attagggatt . 2160' 

tgaggtaggt gttgttggtt gttagtatgt ttttgtgtgt' gtggaagagg gtgttttgta 2220 

ttatgatgat tatgttgtat aagaagtttt tggtgtgttg gttgttgagt tgtagtagta 2280 

gttgtttgga gtggttgggt gtttttattg tgtttagtat tgtttgttta gtatattttt 2340 

ttgtggggat atatattggt tgggtgagag' ttgtgtggtg ttttggttgt ttggttttag 24 00 

tttggtattt ttttttttta tttttttttt ttttagttga gtgggggtat tagttttgtg 24 60 

gtttgggtat tggtgaagga ttggttgttt tttggagtgg gagtttaggt tggtttgt'tt 2520 

ggattaggag aaggagtagg aggtgagtgg ttgttggtgg aggggaggtt agggtggttt- 2580 

gtatgtttta gggtatttgg ttgggtgttg "gggtttttga gaagaaaatt gtttaggtgt 2640 

agtgattttt ttggagatag ttatttgatt taagtaaaat gtttgtttta ggaaaagtta- • 2700 

tttagggtgg a'gaattttat ttaagtaggg agaaagggag ttgaggaatt agtgtttttt 27 60 

gttttgggag aagttgtttt agttggggga agtgatatgg aggaggggag tgtggtgttt 2820 

gttttggtgt tgttttggtt gggggttgtt aatttttggt tggggtttgg gtggtggttg . 28 80 

tgtggggagt ggaggtagtg gttgttgtgg tgggtagagt gtgaaggttg ggtttggtgt 2940 

ggggagggtg ttatattggg gtaggaggtt gaggtaggaa gtaggtgggg gggagggggg ■ ,3000 

agttatgtag tttttagggg agggaggggg tagtgttttg ggtgggtatg gtgtatagtt ;3060 

ggttgtggtt ttgatttggg tttgtgtttt atttgtgttt tggttttggt ttgggtttta 3120. 

tatgtgtggg tttggtgttt ttttttgtgg tttttttggt tttttttttt ttggaatttt 3180 

gttgttttaa atttggggaa aagtttttta attgtagata gggtgggagg agtgtgttgg 324 0 

ttttaggttt ttggtttgta gttttttttt gtggttttta aatttggtgg tagagtttgg . 3300 

agttgagttt tgagtttttt tgtttgttgt ttgtttgttt gattttgttt tttttttggt 3360 

ttgtggggtt ttgtggtttg ttatttgtgg ttttggtggg ttgggttttt ttttttgtgg 3420 

tggtggtagt- ttttagttga tgttttattt gttgttgtta ggttttgagt tgtgttaggg 3480 

tagtgttttt gttagttttg t ' 3501 

<210> 18 • 
<211> 2501 
*<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 



12?.- 



<400> 18 



tttttatagt 

tagtataaaa 

tttttttttt 

gggttgttat 

ggatagggtt 

ggttgtattg 

atttaggaga 

agattattta 

aagtagtggg 

gtagggatta 

gttatttttg 

aaatttagaa 

gtgttgagtt 

aaatggtttt 

ggattttaag 

gatttttgtt 

gttttgtttg 

tgatgggtgg 

ttgtgttttt 

gtagttgagg 

attttttgag 

tttttagtat 

ttgttgattg 

tatttttttt 

tgtgaggggt 

ttatttggtt 

tttttttaaa 

gttataattt 

ataggggtta 

ttggagagta 

gtggtaaaat 

aataaattat 

agtgggtgat 

ttattggtag 

taaatgttat 

taaagtggtt 

attttatatt 

ttttttaaat 

attttataat 

gaggtttagg 

ggggttatat 

tttatatatg 



gtaaatgtgt 
tttaaatatt 
taaatttagt 
tttaggtttt 
tgtttatttt 
ttattgaaag 
tgtttagtaa 
ggaatatttg 
ggtttaggat 
ttatattgat 
gaagtttttt 
aatgtttatt 
aattattttt 
tggaaggtaa 
ggagttgtgt 
ttgtttttgt 
tttgtgtgtg 
gtggtattgg 
tgttgaggtg 
ttattttgtt 
gaggaaagag 
tttggtgtgg 
taaaattggg 
gtaattaatt 
gggggttgga 
gtttagtgtt 
gagggtgttt 
gtttttgaaa 
ggtgtatgat 
gagaattttg 
ggaaaattgt 
ttatattaat 
ttttgtaatt 
tattttatat 
aaattggaat 
ttttatgttg 
tatagttgtt 
gtttttagga- 
gtttggtgtt 
gattatattt 
ttgaaaatta 
gtatgtagta 



ttttattatt 
taaatatgat 
tagtgtttgt 
ggtttttttt 
gatattatta 
tttattattt 
atgttgattg 
ttaaaatgtt 
tttgtatttt 
aaatttttat 
aaggataggg 
gagtgaatga 
ttatataagt 
gttttttagt 
gttttgtgtt 
tattttttgt 
tgttgttgta 
ttattggtgt 
tagttggttg 
gaggtggtgg 
ttggaaatat 
gttttttaag 
tgaaagtttt 
tttttagatt 
atttgttttt 
tgtttttttt 
gataagagtt 
tttattggag 
ggagaaagaa 
aagtggattg 
tatgtgaaat 
atagatgttg 
tttaagtgta 
ttggaaaggt 
tttgttaatt 
ttttttagaa 
tggttatttt 
aaatattttg- 
ttaaattttt 
taagatgttt 
tttaaggata 
tattttatat 



ttttggagta 

tttgttggtt 

gttgtttgta 

tagaaagttt 

gtttatttat 

gattttgagt 

aattatgatt 

tttgtttttt 

gatagttttt 

tatttttaag 

gttgtatgtt 

tgaaatgata 

tttagtagtt 

gagtggagtt 

tattttttta 

ttttttttgt 

ggttttattt 

gggggtaggt 

ttttttttgg 

ggaggggaat 

ttggtttttt 

gttttgtttg 

ttttgatttt 

tttgtttagt 

tgaagtgtag 

tgttgtaaga 

tttttttgtt 

ttttggtaga- 

aatgggagta 

gttagtggga 

agttttattt 

tgtagtgaga 

gggattttga 

tttaatgtat 

aatttttatg 

aatgttttta 

tgtatgggta 

tttatatttt 

tttgatagtt 

ttgttattgt 

gggattgttt 

tagtatttag 



attttattta 

ttgtttttgt 

atgttttttt 

tttttttttt . 

atatattggt 

agtttgagga 

gtttttaata 

gagattttat 

tgatgtttgt 

ggggattttt 

gtttttttag 

ggtgaataga 

tttattgttt 

atttttaatt 

ttagaaattg 

tttgtagttg 

atgggtttat . 

gagtatgtga 

gttggtatat 

ggttattttt 

aagtaggtat 

agaggagagg 

tatttgtggg 

tttttaaagg 

agatgttagt 

ttttaagttt 

ggagtttgta 

ggttgtaagt 

ggatgggtat 

aagttgtttg 

tttaaagtat 

ttttatatta 

tattatgtat 

gaattatttg 

tatttttata 

gtgttgatga 

tgtatttggg 

gtttttattg 

tttggataat 

taaggagatt 

tttttgatat 

t 



aaattgtttt 

ggttttattt 

ttttttgtag 

tttttagtgg ' 

tataagttta 

ttttattaaa 

tataaatgta 

tttgggaggt 

atgtagaagt 

ttttttaggg 

gttagtaatt 

tgaatgtaag 

ttagttgtag 

atatttttta 

gtaagttatt 

gtgtttagtg 

tgttgaggtt 

aggttggagg 

gtgtgtagtt 

gaggtattgt 

agtttgtttt 

ttaggttggg 

tattgattgt 

attgaaaagt 

ttttgaaaag 

gtgagaggat 

tgtttagtaa 

ttaaatgtgt 

atttgaggaa 

tattttagga 

aaaaaattaa 

gttttttatt 

ttttgatttt 

agttatatat 

tttgtattga 

atagttaagt 

tgtagttata 

taaataatgt 

ttttatgtag 

ttttttttta 

tattagtata 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
■1140 
1200 
1260 
1320 
1380 
1-4 4 0 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280. 
2340 
2400 
2460 
2501 



<210> 19 
<211> 2501 
<212> DNA 

<213> Artificial Sequence 



<220> 



<223> chemically treated genomic DNA .(Homo sapiens) 



<400> 19 ■ 

attgagtatt 
ga-agtagttt 
agtgatggta 
gagttgttaa 
taaagtgtaa 
atgtttatgt 
ttgaaagtat 



ggtgtaaaat 
ttatttttga 
aaaatgtttt 
aaaagattta 
ataaaatgtt 
agagataatt 
tttttgaaag 



gtattgtata 
atgattttta 
agaatgtaat 
aaatattaag 
tttttaaaaa 
agatagttat 
gtagtataag 



ttatgtgtaa 
gatatagttt 
ttttggattt 
tgttgtaaaa 
tatttagaag 
gggtataaaa 
aagttatttt 



gtatgttaat 
ttgaaaagga 
tttgtatgaa 
tatattattt 
gtatgattat 
tatttggtta 
attaatatag 



ggtgttaaaa 

aagttttttt 

aattatttaa 

ataataaaag 

atttaaatat. 

tttattaata 

atatgaaagt 



60 
120 
180 
240' 
300 
360 
420 



-28* : • 

• • • ♦ • 



atataggaat taattgatag aattttagtt tgtaatgttt aatatataat ttaaataatt 480 

tatgtattag ggttttttta ggtataaggt attattagtg gagaattaaa ggtgtataat . 540 

gttaagattt ttgtatttgg aggttataga ggttatttat tggtgagaaa' ttaatgtaaa . 600 

attttattgt atagtattta tgttggtatg aatggtttgt tttaattttt tgtattttaa 660 

aaaatggggt tattttatat aataattttt tattttgttg tttttgaaat ataggtaatt 720 

tttttattgg ttggtttatt ttggaatttt ttgtttttta gttttttaga tgtgtttatt 780 

ttatttttat tttttttttt tattatatgt ttgatttttg tgtgtatttg agtttataat 840 

ttttgttaag attttagtgg attttgagaa tagattgtga tttgttaagt atataaattt 900 

-taatggggaa gggtttttat tagatgtttt ttttaaagaa ggttttttta tgaatttaaa 960 

attttatgat agagggaaat aaatattgaa tgattgaatg attttttagg agttgatatt . 1020 

tttgtgtttt agggggtgaa ttttagtttt tgttttttgt ggttttttag ttttttaaaa 1080- 

gattaggtaa agatttaaga gagttaattg taggaagagt aataattgat gtttatagat 1140' 

aagggttagg gagaattttt atttagtttt gtaattagta gtttagtttg gttttttttt' 1200 

ttaggtagga, ttttgggaag tttatattgg ggtgttgggg agaagtgggt tgtatttgtt 1260 

tgagagatta .ggtgtttttg gttttttttt ttttaagaga tgtggtgttt taagaataat 1320 

tatttttttt tttattattt tagtggggtg attt'tagttg tggttgtgtg tgtatgttgg ■ 1380 

tttgaaaaga gtagttagtt gtgttttagt aag'gggtgtg gtttttaatt tttgtatgtt 1440 

tatttgtttt- tgtgttggtg attagtatta tttgtttgtt gaattttagt ggtgagttta 1500 

tgaataaggt ttgtaatgat atatatatga ataagtagag ttgttggatg tt'gattgtgg - 1560 

gataggagga ggtggggaat ggtgggggtg ggatgagggt tagtgatttg ttgatttttg 1620 

gtaggaagat gagtgtagag tgtgtggttt ttttggaatt ttgggaaatg tagttaagag 1680 

tgattttatt tgttggaaga tttgtttttt aggggttatt tttgtggttg gaagtaatgg 1740 

gagttgttag gatttgtgta gaagaatagt taatttgata ttttgtgttt atttatttat * • 1800 

ttgttgtttt at'tatttatt taataaatgt tttttgggtt tagttgttga tttagagaaa 1860 

tagtatgtgg tttttatttt tgaggggttt ttagagatag tttttgggaa ggaaagtttt ■ 1920 

ttttagggat ggtaaagatt tgttagtgta ataatttttg tatttttata tgtaaatatt 1980 

aggggattgt ' taaaatgtag .agttttggat. ttttattgtt tattttttaa aatagaattt 2040 

taaggggtaa aaatattttg ataagtgttt ttaaatgatt ttgtgtttgt atgttgagaa 2100 

tagttatagt ttaattaata tttattgagt attttttgag ttttgatagg atttttaagt . 2160 

tatttagagt tagatggtaa atttttaata atggtgtagt ttagatttgt gattaatgtg 2220 

tgtaagtgag ttaatggtgt taaaataaat agattttatt tttgttgggg agaaagagga 2280 

aaaatttttt gaaggaggat taggatttaa agtggtgatt tttgtaaaga aagaagggta '234 0 

ttataggtag tataaatatt agttaggttt ggggagaaag agggtaaagt tataaaagta 2400 

aagttagtaa gattatgttt agatgtttga attttgtgtt gaaaatggtt ttaagtagga" 24 60 

ttattttaga gagtggtggg aatatattta tattatggaa a- 2501 

<210> 20. 
<211> 2470 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 20 



aaagatgatt aaaagtttaa ttgtttattt gaagagttga tttttttatt tttgtaataa 60 

agggtatttt tagtagtttt tgtttatttt gtttatttgg ttttttttgt ggttgtgtaa 120 

ggttataatt. tttgtgtttt agtaaatttg tgtatgttta tttttttttt tgttattatt 180 

ttttttttta ttttgtttta ttattttgat gtaaaattat ttgttaattt tatttgaaat 240- 

gagaaatttt aaggtttata ttatttaaat tttgttagat' ttttattttt gttatatggt 300 

ttataatgtg ttgggtattt ttagatttgt ttattaaaaa gatgtaaaat aaaataatga ' 360 

ttatttttgt ggattttttt tttatttttg agatgttttt tttggttgta ttattttttt - 420 

attttttgtt tattgattag aggaggggtt ttaattatgg gtgaatttta tattttattg 480 

aagaggttat gttatatgta tatttttata atataattta tatttatata gtatttttat ' 540 

ttttagtata ttttttttta ttaattttaa taatattatt gtaagttatg ttgaagtaga 600 ' 

ttgtaagtgt ttatttataa attgtgaaat - gaattaaaat gaaagggtaa agattaaatt 660 

atgattaggt ttgaaattaa tatataagat ttaatttttt ttaattaaag atttttgtag "720 

gtgatttttg tttgtaggat tttttttttt ttttagatgt tattggattg' tattaggttt .780' 

attgtagatt ttagttgttg * tagaattaat tagatttaag atgagttttt " tgattttttt 840 

tggtagagtt ttttaattgt tgaattttaa tattgttgtg attagttagt gt'tataattt 900 

gtttgtttta ttttgtgtaa tggattttat attatagagg tattttttta atgttaagat* 960 
gtttaagtat tgtttaagtg taaattattt aatatttttt agttattaag taattaagat . 1020 



-I29C- 

• • •*« 



aggtaggatt 

tttgaatttt 

tttttttatt 

taattgttag 

tttttaggta 

agtataggtt 

tttatttatt 

tttattttgt 

tg'gaggatgt 

ttattaattt 

agtattatta 

aatgggtaaa 

tttgtgagaa 

t'ttattttga 

tgttttgaga 

gtttttttta 

aaaatttttt 

ttagttgtta 

agaagataat 

aagattgtga 

tgaggtaaga 

ttttgtaaaa 

atataaagaa 

taaagagaag 

tagaaattag 



ttatttgttt 
atttggttaa 
tggttatttt 
attttgtttt 
ttttttggtt 
attatagtta 
atttttttat 
taaatgattg 
gtaagattag 
attttggttt 
atataaataa 
atatttttaa 
ggtaaatgta 
attaagattt 
ttaaaaatta 
tttttttttt 
ttaattttat 
aagtaagttg 
tttaattatt 
tttttaattg 
•tattttattt 
gttagtattt 
aataagtgta 
agattaaata 



taaaatgatt 

ttttaatata 

ttgtattttt 

attttttttt 

ttagtttgag 

attttttgaa 

taaaaaggtt 

ttttttaatg 

ttttttatta 

taaaattttg 

atttttatta 

aattttgaag 

tttagatata 

tattgtttta 

gtatattaat 

taaaaataag 

aggtttaagg 

aggtattgaa 

ttataaaatt 

tagtttattt 

aaaagtgtat 

atatttttaa 

aaattttaat 

tagtattatt 



tgatttaaat 

atttttagta 

tttgtttttt 

gatagttttt 

tatagtagat 

taaatgtgat 

taaagttttt 

atattttaga 

aataaaaagt 

tgattatt'ag 

taattatatt 

aaattattat 

attaaagttt 

tttttttaga 

a'taatttttt 

tagttattaa 

gtagttaagg 

gatggagaat 

gga'aattgag 

atttttattt 

tttaaattaa 

ataagatata 

ggttagttag 

gtattgaggt 



taaaaagaga 
ttttataatt 
tttttttttt 
atttttaagg 
tttaagatta 
tgaattttat 
atttaatgtt 
attttagaat 
gtgaaatgga 
ataaaattta 
ttttaagttt 
ttgatagaaa 
ttttttttat 
tgttgttatt 
ttaaatgttt 
attttttagt 
atggttgtag 
ttaaattttt 
gtatttaata 
agtatttttt 
gtaataatat 
ttgaatttat 
"tttttagttt 
aaggtttttt 



atgtggattt 

ttttttaaag 

ttagttataa * 

ttatttattt 

tatatgttat 

gttagtaatt 

tttttttatg 

tattttatta 

atatgtaatt 

gaaataaaat 

tgtttgtaag 

gtttaattta 

tttaatttta 

tgaataatat 

taagagtttt 

agtgaatttt 

ttttatatga 

gataagagtt 

tgaaggtatt 

tttgtaaatt' 

gtaaattttt 

ttagtgaatt 

tttttaagat 

gtgtagttta 



1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560. 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
' 2280 
2340 
2400 , 
■2460 
2470 



<210> 21 
<211> 2470 
<212> DNA 

<213> Artificial Sequence 



<220> 



<223> chemically treated genomic DNA (Homo* sapiens) 



<400> 21 

ttagttttta 

ttttttttta 

ttttttatat 

ttttgtaaga 

ttttatttta 

ttatagtttt 

attatttttt. 

taataattga 

aaaggatttt 

taaaggaaat 

ttttaaaata 

ttaagataaa 

tttttataga 

tttatttatt 

tagtaatatt 

ggattaatga 

atatttttta 

ataaaatgga 

ggtaaataag 

agtttatgtt 

tatttaaaga 

ttggtagtta 

ggtaagagga 

aagatttagg 

aattttgttt 

atatttaaat 

taaggtagat 



tgaattatat 
attttaaaaa ■ 
gatttattga 
aagagtttat 
agtttataag 
gatattttta 
agtttttgtt 
ttatatgaag 
gaaatttatt 
aaaattttta 
atattgttta 
tgaaattaaa- 
tggattaaat 
tttataggta 
attttatttt 
gattatatat 
tggtgaaata 
' tatgaaggag- 
aattattaat 
atggtatatg 
gaatggatga 
ttatgattgg 
ttttaggaag 
agatttatat 
attttaatta 
attttgatat 
aggttgtaat 



agaaaatttt 

gaattaagaa 

atgaatttaa 

atattattgt 

agggaatatt 

tattagatgt 

gaagatttga 

ttgtagttat 

attaaaaaat 

agatgtttaa 

agtagtagta 

atagaagaga 

tttttattaa 

aaatttagga 

tgaattttat 

tttattttat 

gttttgaagt 

agtattaaat 

ataaagttta 

tggttttggg 

ttttaggagt 

gaaagaagag 

aattatagaa 

ttttttttta 

tttaatagtt 

taaaaaaatg 

■attggttagt 



gttttagtat 

ttaattagtt 

tatattttat 

ttaatttaaa 

gaataaaaat 

tttagttttt 

gttttttatt 

ttttggttat 

ttagtggttg 

gaagaattgt 

tttaagagga 

gaattttagt 

gtaataatt't 

aatataatta 

ttgatagtta 

attttttatt 

tttaagatgt 

gaagatttta 

attatattta 

atttgttgtg 

agagattgtt 

gagagataaa 

tgttaggagt 

gtttaggtta 

aaaaagtatt 

tttttgtaat 

tatgataata 



agtgatgtta 

attggagttt 

ttaaaaatat 

atatattttt 

ggataaatta 

agttttgtaai 

tttagtgttt 

ttttggattt 

tttgttttta 

gttaatatgt 

tgaaatagtg 

t'gtgtttgaa 

ttttaaggtt 

tgataaaaat 

tagaatttta 

tgataaaagg 

tattaaaagg 

agtttttttg 

tttaaaaggt 

tttaaattga 

aaagagaaat 

gaagatataa 

tatattaaga 

aattattttg 

aagtagtttg 

atgaaattta 

ttggagttta 



tatttggttt 
tatatttatt' 
aaatgttaat 
aagtaaagtg 
taattaaaag 
gatgattgga 
taatttgttt. 
ataaggttaa 
aagaaagggg 
tagtttttgg 
gagtttta'gt 
tatatttgtt 
ttaaagatat 
ttatttatat . 
gagttagaat 
ttaattttat 
taattattta 
ataggaagat 
tgattataat 
ggttaaaaga 
gaagtagagt 
aagatagtta 
ttaattaagt 
'gaataaatjaa 
tatttaagta 
' ttatataaaa 
gtaattggaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780" 

840. 

900 

960 
1020 
1080 
1140 
1200 
1260 
1320 
1380* 
1440 
1500 
1560 
1620 



*30- > 



• 



gattttatta 
aatttatagt 
tagggattat 
gtttggttat 
atatttataa 
tatgttaaaa 
ataatttttt 
ggtaaggggt 
ataggagtga 
tatattataa 
aaaatttttt 
taagagaaaa 
agttataatt 
aaagtatttt 
aattattttt 



aaggaaatta 
aaatttggta 
ttataaaagt 
gatttaattt 
tttgttttaa 
ataaaagtat 
tagtaagata 
gaagaagtaa 
ttattatttt 
attatatgat 
attttaaata 
ggtagtaata 
ttatataatt 
ttattatagg 



ggggatttat 
taatttaatg 
ttttggttga 
ttgttttttt 
tataatttat 
tatgtaaatg 
tagggtttat 
tgtagttaaa 
gttttatatt 
agaggtaggg 
aaattaatag 
gagaaaaaaa 
ataaaaagag 
aataaaaaaa 



tttagattta 
atatttgagg 
aaaaaattga 
attttaattt 
agtgatatta 
taagttatat 
ttatagttaa 
ggagatattt 
tttttaataa 
atttgataga 
gtaattttat 
tgggtatgta 
. ttggatgggt 
ttaatttttt 



gttagtttta 
aggaagggga 
gttttgtgtg 
attttataat 
ttaggattaa 
tatgaaaata 
gatttttttt 
taaaaataaa 
gtaggtttga 
atttgaataa 
attaaaataa 
taagtttatt 
aagatgagta 
agatgaataa 



taatggttag 

gttttgtagg" 

ttaattttag 

ttgtaaatga 

taaaaaaagg 

tatatgtaat 

ttgattaatg 

ggaaaaattt 

aaatatttag 

tgtgaatttt 

taaaataaaa 

gagatataga 

gagattgtta 

ttaaattttt 



1680 

1740 

1800* 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2470 




<210> 22 . 
<211> 7001 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 



<400> 22. 

aatgtaatgg 
gtgtggaagg 
ttggagagat 
gaatggtgat 
ttggagata't 
tagtaagtta 
gagttgggag 
gtatattttt 
aagagaatgt 
ttttgattgt 
tttttatgtt 
tttttttttt 
tggttattgt 
tttaatattt. 
gtgtttaaat 
ggtattatta 
tataaatttt 
gtgattagag 
gggtgttatg 
gttagtatta 
gttattggga 
attagtttaa 
tatgtatttt 
gggtttggtt 
ttaaaatatt 
gtttagagat 
tttattgtta 
atttatatat 
tgtttttttt 
gtaggggtaa 
ttgttttttg. 
tgtgtagtag 
.tgttttttga 
ttatatattg 
tttt'tgattt 
• aggtttgttt 
tggttagtgt 



aaaaagagag 
taagggaaaa 
agtgatgttg 
ttatatttga 
atttaatgga 
tggttggaat 
tttgagtgaa 
ttatttatgt 
tggagagaaa 
aagtttttga 
aatagtgaat 
tttttttttg 
tgatttatag 
atttatattt 
attgtaatat 
atatttttat 
ttagtaattg 
ttatttaata 
tgttttggga 
tggttgagag 
aatgagagat 
aataaaggta 
ttatatgttt 
aatgaagtgg 
tattttaaag 
tgggggttag 
aattatattt 
ttttgattgg 
tttgtttggg 
aggggttggg 
tgagtagata 
tttagttgtg 
. atttttgtta 
agttatttgt 
tta'taagttt 
tgattttttt 
ttgtatttta 



attgtaaagt 

ttagtttttg 

ggtatagata 

ataagtttgt 

ggagttagat 

gtgtgtgttt 

gtttttgtta 

ttattaagtt 

gtggttaaga 

ataattaaag 

aattgtagat 

aatatttttg 

gaggtattat 

agtatttatt 

tgggggtttt 

ggttttttat 

tatagtgttt 

tttaaggtgg 

ttttgagtgt 

aatagttgag 

tttgttttta 

tttattttta 

atagaaagta 

aggagtatta 

tttggttttt 

ggttagtaag 

ttttggaaaa 

tattttattt 

gtggtagtag 

gttttttttt 

gtaagttttt 

tgtttgttgg 

ttttatgtat 

atatgtgagt 

atggaatatt 

ttttgtagtt 

gtagggagat 



tagaaggttt 

aagaagatag 

tggggaagtt 

aatgtttagt 

taatttttat' 

agtaggagag 

aaggtagaag 

tagggataag 

aaattgtttt 

tttgtgagga 

tttttttttt 

tttttttttg 

tgttattaat 

tttaatattg 

ttaaagtata 

tttttttttg 

tagggttaga 

tgattaatgt 

gggtatttta 

aaagtggtta 

attatggtt£ 

ttttatgttt 

gttaattgag 

tattttagtt 

tggtgtaatt 

gtatttgatt 

attattta-tg 

tagttgtaag 

aaagtatagg 

tttagtatag 

ttttattttt 

gaggggttgt 

aaatatattt 

atat'tttttt 

tttggaaaga 

ttagtatttt 

gaggattgtt 



aggaattgtt 

tgagatttta 

gagaggaata 

agattgttgg 

ttttttttat 

ggtagggagg 

aggaaagttg 

gtttattaag 

tattgaattt 

gatagttaat • 

tttttttttt 

ggattggttt 

aaagggtaat 

attatggaga 

aaaatatata 

tattggtttt 

gattggttat 

ttggtaataa 

ggagtatttt 

agaggtggat 

gtgtaatttg 

atattttagg 

ttatttatgg 

ggaaatatat 

ggaggtatgg 

tatatgtatt 

ttttattttg 

attatgattt 

gtatttttta 

.ttttttttgg 

tattgttatt 

taagtgtttt 

atatattttt 

tttttttttt 

tgtttttgat 

gagaaagtaa 

gttttttatg 



ttttgattag 
atttgggtgg 
ttatgtttga 
aaaagtgggg 
ttgagagagt 
gaagttaaga ' 
gtgtagtata 
atgagtttgg 
tttgggttaa 
ttttttattt 
ttttttttgt 
a'gagtatggg 
agtttttttt 
gagttttttt 
.tttgtatgat 
'aagagttatt • 
ttttggtatt 
agtttttatt 
agtattgtgt 
ttatgtgaat 
aaagtttaaa 
tttttaataa 
aaaggtttgt 
ttttagaatg 
taatgttttt 
ttagaaggtt 
taaatttgat 
atagtaagtt 
gtttttaagg 
ttgtg'ttata 
tatttagtgt 
gtttattggt 
tttgtttagt- 
• atttttttgg 
ttagtagggt * 
tttatttttt 
ggggtatgtg 



60 
120 
180- 
240 
^300 
360 
.420 
480 
540 
600 
660 
720 
780 
840 
90Q 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800. 
1860 
1920 
1980 
2040 
2100 
2160 
2220 



31- 



tgtgtttttt 
taggtttata 
ttgaaaattg 
gtaatgattg 
ttatttgttg 
gtatattgtt 
aaatagttaa 
attttgttag 
gtattagagt 
attatagjatg 
tatatatata 
aatttagtta 
tagtttaagt 
attttaaaaa 
tttgtgattt 
gttttggttg 
gtttttttga 
ttaatataaa 
attttgggag 
ggtaaaatgt 
ttttttgtta 
tgtatatttt 
tatgtatata 
■aatagagata 
tttttggatt 
gatttttgtg 
aagtaaagta 
gagtgatgtt 
ttgtaatgta 
gttggtggag 
tggatttgta 
gttgtgttgt 
ttttttgttt 
ttttttatat 
agtttttgaa 
tggatagtag 
ttgtggttgt 
ttgaggttgt 
ttttgagttt 
atggttagta 
ttggtttgtt 
gtttggtagg- 
tgtgggatgt 
ggttttgtgt 
aaatgtattt 
ttaataaaat 
tttaggtaaa 
aaagttgggg 
ttttgtttaa 
agttatgatg 
tttatgtttt 
ttggttgagt 
taggtgtttt 
ttgttagttt 
ttagttaggg 
gagattattt 
ataagaattg 
tttgtttttt 
gtgaatattt 
agggtgttat 
ttggtgtttt 
tttttaggga 
attgaagggt 



ttttttttta 
ttttttaaga 
aaagtgtgat 
ttggttataa 
ataatttata 
taataatggt 
aaatgtaatt 
atattttgtt 
ttattaggtt 
tatgatatat 
tatatatata 
aaaatttttt 
gaattgagaa 
tttgttagtt 
gaggttatgt 
tataattaat 
attatttttt 
tatata'agtt 
tattttgtag 
agtttttttt 
gtttttttag 
attttatttg 
ttatgtattt 
tattggagtt 
tgttttttgt 
tgtttttgtt 
aagtttaggg 
taagttaatg 
tatgagtttg 
ggtgtttgtt 
ggtttttggg 
ttttaatttt 
tgtggggata 
taaagtattt 
ttgtttgtag 
taagtttgtt 
tgttaatgtg 
ggtgtttggt 
gttgatgtta 
ggtgttttat 
ggtattttat 
agggagggag 
gtgatttgag 
gtagtttgtg 
tttatttatt 
atatgttgga 
taaatatggg' 
gtgtttggag 
tgtgtttttt 
atgatttttt 
ggggtttta : g 
ttgtgttgga 
gtgagtttaa 
agtgataagt 
tttgtgtttt 
tgtgatttta 
tttttttttt 
gttttttgtt 
ttttaggtgg 
atggttgggt 
tatatttt'tt- 
tatttatagt 
agattttggg 



ggatttgtag 
gttttatgaa 
ttatggaaat 
agggaaatat 
gttattaatg 
ttatttttgt 
tttagtaata 
gggtgtttgg 
gtagtaaagg 
tttatttatt 
tatatgtgtg 
tttatttgtt 
gattgagttg 
ggattagatt 
ttggtatgaa 
ttgtggaagg 
atatgagaat 
aggttgagag 
aggaagaaat 
atatgtatat 
ttgttaaata 
ttttattttg 
ttagttaatg 
tggtatgggg 
gtttatttta 
ttttggttgt 
aagttgtttt 
ttagggtaag 
ggagattagt 
ttgggattgt 
gtagggttgg 
gggttgtgtt 
tggtttgtat 
gggatggttt 
tttaagattt 
gtgtataatt 
taggtttatg 
tttaatggtt 
ttgtatttgt 
tatttggaga 
aggtatttgt 
ggagggaggg 
ggtgtgtgta 
ttgtgtttag 
tattgtttgt 
aaattagata 
gtgttttgag 
ttagtagagt 
aaattttttt 
gggaagtgat 
agagtgattg 
gtggattgtt 
aataagttat 
ttgttttatt 
gtagttattg 
atggtgaagg 
tttttttttt 
tttttgatgg 
ttataggata 
tttgaattag 
tgttttgtaa 
agaaggaagg 
ggtatttagt 



gattttttgt 
gtgttttttg 
taaattattt 
ttgtgatgta 
ttaaatttga 
aagaattaga 
gtaagtttgg 
gatagtaatt 
aagtttattt 
ttttgttttt 
tgtgtatgtg 
ttttatttgg 
taggattaaa 
gataatgtaa 
aaggttatat 
tatgaatatt 
ttttaatggg 
aattttagaa 
tgaggttttg 
tttgaatttt 
tagttgtttg 
gttatagtgt 
aggaggggga 
tatataaggt 
agtttagttt 
gaaatttagt 
ttgggattgt 
gtaatagttt 
atttaaagtt 
atttgttttt 
ggttagagtt 
ttttttttag 
tttgtttgtg 
tattgtatta 
ttttggagtg 
attttgaggg 
gttagattgg 
tggggggttt 
tgttgtagtt 
atgagtttag 
gtttgtgttg 
agaagggaga 
gggagtttgg 
agttaagttt 
gtgagaggta 
aagtagtagt 
ttatttggga 
ttagtagag;t 
ttatttaga't 
ttttgttatt 
ggagttgaat 
ggtatgtgat 
atggaagtat 
tggggagaat 
tggaaggagt 
ttgtgtgttt 
ttttttattt 



gtttgtagag 
ggtgtatttt 
gtatttttta 
gttgtggatt 
ggattaaagt 
gtgtattttt 



gttatttgta 
tatgtgtttt 
gtaaaaaatt 
tttaatgtgt 
ttttggtttt 
taaaatgtat 
tatttagata 
aaaataaagt 
aaaagtataa 
ttaatatgta 
tgtgtgtatg 
atatttgatt 
ggatagatat 
tataattgtt 
tttatattta 
tatgtgtgtt 
attaagtata 
ggttgtggaa 
gtaggttgta 
tgtttttttt 
tggttggttg 
agtttttttt 
attaaataga 
agtatattag 
tttttgggtt 
ttttatttag 
tttaaattga 
ttggttgttt 
ggaggtttgg 
gttgggttgt 
tgtgtgttgg 
gtggtttgtt 
gttatggatt 
gatttaaggg 
gtttttgggt 
tgttgtttat 
ttttttttat 
ttttttattt 
gttgtttttt 
tggttatatg 
tttgttgggg 
gtttagggag 
ggtgtgtggt 
ttttgttggg 
gatttgaaag 
tatttgtggg 
aggttttgtt 
tttatttatt 
tatttgattg 
tgtttttttt 
gggtttgatt 
ttttgatagt 
aagtgtttaa 
gttttggagt 
gtggttggtt 
ttattttaat 
ttttttgttt 
ggattaggtg 
ggattgggtt 
attgtatatt 
agtttttgtt 
gtagtttggt 
agttgttttg 



tataatttgg 
aaaaaggtat 
gttttggaaa 
ttttaatttt 



agttatattt 
atttgatata 
gattatgaat 
attgatagtt 
attatttaag 
tatatatata 
tttaattttt 
ttgtatattt 
gtagaaatgt 
aaagttttgg 
gttttttgaa 
ttaattaaag 
gtattgtggt 
gggtttattt 
tttttttgat 
tttttagatg 
tgtatgtaat 
agggttattt 
aagagagata 
agaaagttgg 
atttttagta 
tagtgatgat 
gttgtgtttg 
tttagtattt 
gagtttagga 
ttggttttat . 
tgggatatgt 
ggtttttgag 
atgattatga 
aatgagttgg 
gaggtgtatt 
gagtttaatg 
ggttttgggt 
aatagtgtgt 
ttgtagtttt 
gtgtgtgagg 
tggttgttgt 
ttgtgggagt 
ttagtttggg 
tagttgaaaa 
tttgggtttt 
ggaaaatatt 
tttggtattt 
tttttaatgt 
gaaatatgtt 
ttttttttat 
ttggagttag 
tggaaatttg 
aaataatttt 
ggtgtgtggg' 
taggatatag 
tttttttttt 
agtttttttt 
ggtgtttttg 
tggaagtttt 
ggtatttgga 
tagtattttg 
tttagaggat 
gagaaattta 



2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 . . 

2940 ' 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 ' 

3660 

3720 

3780 • 

3840 

3900 

3960 

4020 . 

4080 

4140 

4200 

4260 
4320 
4380 
. 4440 
4500 
4560 
4620 
4680 
4740 
4800. 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400. 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 

.5940 
6000 



-•32:- 



• « 

» _ * 



gagtatttta 
aatttttaaa 
agagaaaaag 
gattaagttt 
gatgtgaatt 
ttagtttttt 
tttggtagat 
gggattgtat 
tgttttattt 
agagataagg 
tttagtttta 
agaattttgt 
gatagatatt 
aaaggaaagg 
agttttgggg 
ttagtttagt 
aaatattaga 



tagttatgta 
ggttggtata 
gatttttttt 
aattttatat 
ttataatttt 
tattgtaaat 
tatttattgt 
tttatttaaa 
tatatttttt 
aatttaaaaa 
gaatgatttt 
ttgttattat 
atttagtatt 
attgattggg 
ttgaaagtag 
gagtttagaa 
gtgtatggta 



gatttaagtt 
ttttaaatta 
gtttattttt 
aatatttaaa 
atgatggatt 
taatgttgtt 
tttgttaata 
attgtttatt 
gtattatatg 
ttaaaaaaaa 
ttttgtgttt 
ttagtaagga 
tttttttttt 
tttttttagg 
tgtttttfaaa 
tattttttaa 
ttttgtaagg 



gtttttattt 
attttatttg 
ttttttttaa 
attgtttgat 
ttagaaatta 
gaattttaat 
attataaagt 
attttagtga 
atatttaaat 
aatttgtata 
tattttttgg 
aaagaagtat 
tagttttttt 
aaattttaag 
ttgtttgtta 
aaggattaaa 
ataagttttg 



aaaagataaa 
ttttaattta 
atggaagaat 
gtgaaggaag 
tttttttttt 
gggtattaat 
gaatttggtt 
taagtggtat 
atttttagaa 
aatgggattt 
attatttttt 
ttatgtaagt 
gtttaaatga 
ttttttaagt 
tgatttagag 
atggaaagga 
t 



taatgaataa 
gggttaaaat 
aaagtatagt 
gtattggtat 
tatttaattt 
gagattgttt 
aaatatagag 
tagtgtaata 
taataaaaaa 
tgtgtggaaa 
tttttttgtt 
t'ttttatatg 
ttttgggtat 
agtttttaaa 
ggttatgaat 
atataataga' 



6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600' 

6660 

6720 

6780 

6840 

6900 

6960 

7001 



<210> 23 
<211> 7001 
"<212> DNA 

<213> Artificial Sequence 



<220> 



<223> chemically treated genomic DNA (Homo sapiens) 



<400> 23 

ataaaattta 
ttttagtttt 
atgataagta 
atttaaagtt 
tagaaaaatt 
gatgtttttt 
tttgggaaat 
ttgtgtaagt 
tgtttgaata 
attattggga 
tattttgtaa 
tattgaaatt 
atggtttttg 
tattaagtag 
tttaaaaaaa 
ataagtaaaa 
tgagtaaaga 
tgaagatgta 
tattttagtt 
tggtttatgg 
ttgggaaatg 
tggggtatat 
tttttgtagg 
aaaatgggag 
aggatatata 
•tgtttttttt 
tatttttttt 
tgtgttttta 
agttatatgt 
tatttaattt 
gggtgatagg 
agtttagatg 
. aattttgttg 
tttttaagtg 
"aattgttgtt 



tttttatgaa 
tttaaaaaat 
gtttgaaaat 
ttttaaaaga 
gagaggggaa 
ttttttgttg 
aaaatatagg 
ttttttttta 
ttatgtaatg 
tgatgaataa 
ttattagtaa 
tagtagtatt 
gagtttatta 
ttttaaatgt 
aaaaggtggg 
ttaatttgaa 
tagtttggat 
tattgaatat 
tttttttttt 
tttataaggt 
tttaatttag 
ttgttttgtg 
tttattagga 
ggagaggaga 
atttttgtta 
atagtggttg 
gggtgggata 
tatggtttgt 
tagtaatttg 
ttaattgttt 
aattgttttt 
aaaggaaatt 
agttttgtta 
atttaaagtg 
ttgtttggtt 



atattatgta 
attttagatt 
attgttttta 
gtttaattaa 
aaatattaaa 
gataataata 
aaaaattatt 
atttttaagt 
tagaaagtgt 
ttttgaataa 
ggtagtaaat 
aatttgtaat 
taaggttatg 
tgtgtggaat 
tagaagaaat 
atatgttaat 
ttgtgtggtt 
ttttagaatt 
tgttataaat 
agaaagatat 
gatttagtta 
gttatttagg 
aaataaaaaa 
gggagaaaga 
ttgaagttat 
tgaggtgtag 
ggtttgttat 
tttaaatttg 
ttttagtgtg 
tttaaagttt 
tagaaagtta 
tggggagtat 
attttaaata 
ttttgtgttt 
ttttaatgtg 



ttttgatatt 
tattaaattg 
gttttaaaat 
tttttttttt 
taatatttgt 
ggtagaattt 
ttaaaattga 
tttttgtttt 
aagatagggt 
gatgtgattt 
aatttattaa 
aaaagaattg 
gaatttatat 
tagatttaat 
tttttttttt 
ttttaaaagt 
gtgggatgtt 
tgttttttag 
atttttggaa 
agggatatta 
tgtggtgttt 
aaggtgttta 
taaaaaataa 
ataatttttg 
aaagtggttt 
attttggttg 
tgggttggta 
tgggatattt 
gatttagtta 
taggatgtgg 
ttattatagt 
attaaataaa 
tttttaattt 
atttgtttgg 
tgttttgtta 



ttttattata 



agtttatgat 
ttttgagaat 
tatatttaga 
ttatatgaag 
taataaaaga 
atttttatat 
tttttttatt 
atattatatt 
tttttgtatt 
ggagtagttt 
aaaattaaat 
tatattagtg 
tgttgtattt 
tgttttaatt 
tttgtttatt 
ttaaattttt 
tattttttgg 
atagaatatt 
gtttggatat 
tgaagttttt 
ttagaagtgt 
aaggagaaat 
tagggaaaaa 
tttgtgtttt 
atttgtatgt 
ggagattatt 
ataaattttt 
gttaattttg 
ggtggggagg 
tgatatattt 
aatattaaaa 
taaatgttaa 
aggtgttttt 
ggaagtttgg 



ttttttttta 60 

tttttgggtt 120 

tatttaagaa 180 

gttatttaag 24 0 

aatttgtata 300 

ggaaagataa 360 

agagttttat 420 

attttaagag . 480 

gatattattt 540 

tgattaggtt 600- 
tattagtgtt . 660 

agggaagaaa 720 
ttttttttta .- 780 

tgttttttta 840 

ttaaattaaa 900 

gtttgttttt 960 

ttaaggtggt 1020 

ggttaaattg 1080 

gaataaaaat. 1140 

tagtgtatag 1200 

aaatttagtt 1260 

ttatttaatt 1320 
tgggtaagag ' 1380 

aattaaaatg 1440 
ggattggttg - 1500 

tattttgggg 1560 

tttaagtatt 1620 

ggttgttaga 1680 

aaattagatt 1740 

aggggaaagt 1800 

ttaattaaat 18 60 

ggataaataa 1920 

gagtgagatt 1980 

t-tttataaat . 204 0 
gtttttgggt - 2100 



us- : 



• • • ••« 

* • * 



ttgttttttg 

agaatttgat 

tttgggtttt 

tttttttttt 

gtggtgtggg 

gttgggtttg 

tagttgtggt 

gaaatttttt 

gttggtttga 

gtttttgggg 

ttgttttagg 

ttgatgtagt 

ttgtgggtag 

tttggaaaaa 

gagttttggt 

tgggagtaag 

taattttaag 

gggattgttg 

agtgatttta 

ggttgagttt 

aagattgggt 

tgttttatgt 

tttttttttt 

tatattgtaa 

atagatagtt 

ggagatttag 

ttaggatttt 

tttttgagat 

ttttattagg 

gggtgtttat 

aatgtgattt 

gttatattgt 

ttttagtttt 

tttagatgaa 

gtatatatat 

aggagataaa 

tgaataaatt 

taattgttat 

gttaagttta 

atttaatttt 

attgagttta 

gtgtattgta 

tagatgattt 

gtaaaaagta 

tataaagaat 

tagtaatttt 

taaaatgtta 

gtttttttag 

ggaaggaatg 

tggatgtgtt 

ggtagttttt 

ggggagtgag 

gttgtattgg 

ttttatgttt 

ttttatggtt 

atatgggtgg 

ggattaaatg 

tgattgtatt 

tagttgaaat 

gtttagttga 

tgaatataag 

ttgattgtga 

ttgattattt 



tatggatggt 

tttgaatgta 

ttgtgtgtat 

tttttttttt 

tgtgggtatt 

ttttttaggt 

ggtgggtgta 

aggttgttgg 

ttgtagattt 

tagttgtata 

gggattttga 

agggttattt 

ggtgtagatt 

gagtatagtt 

tttggttttg 

tgtagtttta 

tattggtttt 

ttttgttttg 

aagagtagtt 

tatggttagg 

ttaaaataaa 

gttttgtgtt 

ttgttggtta 

ttgagatagg 

atatttagta 

ggtatatata 

agtttttttt 

ttttttagtt 

aatttttatg 

gttttttata 

ttttatatta 

tggtttggtt 

ataatttgat 

aaataaatag 

atatatatat' 

aaataggtga 

ttttttgttg 

tttaaatatt 

ttgttattaa 

tataaaiaata 

gtattaatga 

gatatttttt 

aatttttata 

ttttataggg 

tttataagtt 

tatttttttg 

aagttataga 

aaatgtttta 

tgtttgtatg 

tgtgtgtgga 

tttggtagat 

gggagatttg 

ggaggagaaa 

tttgttgtta 

aaaatagaat 

tttttttgaa 

ttttattggt 

aggaaattag 

gtaatatttt 

ttgttttttg 

ataaaggtag 

ttgggaatga 

ttttaattat 



aagtgggtgg 

gtgtgggttg 

ttttgggttg 

tttttttttt 

tgtagaatgt 

agtagggtat 

gtagtattag 

agttgaatgt 

gtgtgttggt 

tggtgggttt 

gttgtggatg 

tagatgtttt 

gtgtttttgt 

tgaggttaga 

ttttgggagt 

ggatgaatgt 

ttgagtttat 

atattggttt 

tttttgaatt 

gggtgggggt 

tgtgaaagat 

agattttgat 

gaaatatgta 

gtagatagaa 

gttgggggaa 

taggaagagt 

tttgtaaaat 

tgatttgtgt 

tgaaggatga 

ggttggttat 

aatataattt 

tagttaatag 

ttttttggtt 

aaaaaagttt 

atgtgtatat 

agtatattat 

tagtttaata 

taatagagta 

gagttatatt 

gattattgtt 

ttatagattg. 

tttatggtta 

aattatattt 

tttttaaaaa 

ttgaaagaaa 

ttaggatata 

gagagaaatt 

tgggtttgta 

tgtgagtggt 

atggtaggga 

atgtagttgg 

ttgtttgttt 

ttttagtttt* 

ttttaagtaa 

gttagttaaa 

agaatataat 

tttagttttt 

attttggaat 

tttattttgt 

taaatatgtg 

atatttttgt 

ggttttttat 

ttttttggtt 



agagtatgtt 
tatgtagaat 
tgtgttttgt 
ttttgttggg 
tggtgggttg 
ttgttggttg 
tgggtttgga 
tgtagtttta 
ggtggttgtg 
gttgttgttt 
gtttaggggt 
ggtgtggagg 
agggtagaag 
ggtgatgtag 
ttgtgggttt 
tttttgttag 
atgtattata 
aaatattatt 
ttattttatt 
gtatgaggat 
ggatttaggg 
atatttttgt 
gtgtgtatat 
tggggtgtgt 
ttgatagggg 
tgtattttgt 
gtttttaaag 
gtttatgttg 
tttagaaaaa 
gtaattaaaa 
taggttatga 
atttttaaaa 
tatttgggtt 
ttaattgaat 
atatatatat 
gtgtttataa 
gattttgata 
tttgatgaag 
tttgattatt 
ggataatatg 
ttagtaaata 
gtaattatta 
ttaattttta 
atgtgaattt 
aaggagatat 
gatattagtt 
aaaataagtt 
gaagttaagg- 
ttagtgtgtg 
tttgggaagt 
gttattgtat 
atagggagta 
tttgttttta 
gaggaaaaat 
agtgtatgga 
ttaataataa 
agtttttgaa 
aaatgttttg 
taattaaatt 
aaaaatatgt 
tttaaattga 
tttttagtgg 
atagtattaa 



ttttttagtt 
ttttgggttg 
ggtttttgta 
tgtggtggtt 
gttttgtgta 
tggggttgta 
gatatgttgt 
gatttggggt 
gtgttgaatt 
aggtatattt 
tttagtttgt 
gttatggtta 
gtttagaaat 
tgtatgtttt 
ggtgaagttg 
tttttgggtt 
aaggtgttgg 
ttaggtataa 
tgttgttgtt 
ttgttaaagg 
gttggttttt 
ttgttttttt 
aggatga'ttt 
ggttgtatat 
gtatttgagg 
tattaggaga 
tagatagatt 
gattatagta 
tttttggtta 
ttttagaaaa 
attaaagttt 
tgtatttttg- 
aggatatgta 
taaaagttaa 
atatatatat 
ttttggatag 
taattattaa 
tgtttatggt 
ttatattaag 
taaatgtagt 
aagggttaaa 
ttttttaaag 
aatgtttttt 
gttaaattat 
atatatattt 
agaaaggtaa 
tattttgttg 
gttgagagag 
aattaggtag 
agttagtagg 
agtgttggat 
gtgtggtata 
tttttggagg 
aggtttgttg 
tattaagttt 
aagttttttg 
tagaggtatt 
gtattttagg 
tataaatttt 
• attattaaaa 
ttttaggttt 
tgtt'tatatg 
tatgtaatat 



gtttggtgag 

ggttgtgtgt 

gttttttagg 

attttgatgg 

ttgtgtagtt 

ggaaaggtga 

tgagtggggg 

tgtaggggag 

tgtaggtggt 

tgtttagggg 

ttttttggat 

tggtttgtgg 

tggtgggtta 

gttgatatgt 

ggtgatttga 

tttgggtttt 

aggatggtta 

tttgatttgg 

gttggataga 

tggtttaggg 

tttaatgtgt 

ttttgtttga 

tggggaggat 

gtagttagtt 

ggaagggggt 

atgtaatttg 

tttttataat 

ttgtatttgg 

gggtgtatat 

ttgaatataa 

tggtaattat 

tatgtttatt 

gaattaaata 

atatgtatat 

atatatatta 

tttatatttt 

tattttgttt 

ttatttaaat 

tatatatttt 

tgaagttaaa 

aatatattag 

taatttttta 

taaaatatat 

atgtaaatgg 

ttatggagaa 

gttgtttttt 

gattaagaat 

tgagaaggaa 

agagagtgtg 

tagggtattt 

gaatggtagt 

gttagagaaa 

ttggaaagta 

tgaattatag 

ataaaatagg 

ggatatatgt 

gttatgtttt 

gatgtgtttt 

tttatgaata 

gtttaggata 

ttgagttgta 

gatttatttt 

tgaggtgttt" 



2160 
2220 
2280 
2340 * 
2400 
2460 • 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 " 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
'5340 
5400 
5460 
5520 
5580 
5640 
. 5700 
5760 
5820 
5880 



34- : 



ttagagtgtt 
gatattagtt 
ttttggtttt 
ataaaaaaga 
ttgtgttttg 
ttagtattga 
tgttggtgat 
ttagaaaaaa 
aaggaaaggg 
ttttttatag 
aaaggtagtt 
ttttgttttt 
ttttttgttt 
ttttttttgt 
ggtaaagatt 
tgttgggtat 
taattttttt 
attgtttttt 
taagtttttt 



tatgtttagg 
attattttgg 
gggatattat 
aatagaaagt 
aaagattttt 
aaataaatat 
agtggtgttt 
gtaagagtat 
gatttgtaat 
gttttgattg 
tttttaatta 
gggtttggtg 
ttggtaaaaa 
tggatatata 
gatttggttt 
tgtaggtttg 
gtgtttgtgt 
ttgagggttg 
agttttataa 



gttttaggat 
atattaaatg 
gtagttattg 
tataaaaatg 
agtattgtag 
tggatataaa 
tttgUggtt 
ttagggaggg 
tgtttattat 
tttagagatt 
tttttttttt 
agtatgggtg 
ttttatttgg 
tgttttagtt 
ttttgttgaa 
tttaaatatg 
ttagtattat 
attttttttt 
tttttttttt 



atatgatatt 
attttaatta 
agagatttat 
ttaatgatgt 
tgtttgagta 
taaatattga 
aataatggtt 
aggagagagg 
tgatatagga 
tataattaaa 
agtatttttt 
ggaaagtata 
gtttttagtt 
gtgatttatt 
tgtgttttta 
agttattatt 
tgttttttta 
gttttttata 
tttattgtat 



taatggaggt 
taatgttagg 
gagtggtttt 
tattatgtaa 
taggagagtt 
aaagaaagat 
atttatgttt 
aataggggaa 
agaataagaa 
gttagtttaa 
tttaaat'tta 
ttgtgttatg 
tttttggttt 
ggttttttta 
gttttatttt 
tttaaatatg 
gttatttaga 
tttaattaag 
t 



tttgttgtta 
a.gtggttggt 
tgagattagt 
atatatgttt 
ttttttatag 
tgttattttt 
tagattagtt 
aggagaagga 
ggttagttgt 
gaagtttagt 
ttttggtgag 
ttgatttttt 
ttttttttat 
ggtgaagaag 
tttagtggtt 
gtgttttttt 
ttaaaatttt 
aggtaatttt 



<210> 24 
<211> 11001 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 



<400> 24 

ttaagttaga 
attttatagg 
ttttttatgt 
taaagtatta 
gtttggaagt 
ttgtttttta 
ttagtgataa 
tttttaggat 
gttgtttatt 
tagggttagt 
. tagggtgggg 
ttgtgggggt 
tttgtgtgtg 
tggtgtttga 
tttttgttgg 
gtgtaagaat 
ggtgagatgt 
• tttaagataa 
tttttatgtt 
ttgggtattg 
ttattttata 
tagggttttt 
tttgtttttg 
ttttgtatgt 
gtttagggtt 
ggtatatggt 
taaaattagg 
taggaggatt 
atttttatta 
aattgtttag 
agttattatt 
ataaaaaata 
ggttgaggtg 



tgttttttaa 

tttaatattt 

tattagatgg 

aggtaatttt 

ttttgtttta 

gtataatttt 

ttaagaaatt 

aaagtttagg 

atttaatagt 

tgtttagttt 

tgggaagttt 

tatttattat 

atagtatatg 

tatttgtgtt- 

agggtataga 

tgtttttgtt 

tagtttattt 

ttttagttgt 

ttttttttga 

tattttttga 

ggtttgtttt 

ttgaaaaatg 

atgtatatat 

tttgggtgga 

ttagtattga 

taggggttag 

ttgggtgtgg 

atttgatttt 

aaaaaagaaa 

gaggttgagg 

gtgttattgt 

aaaataggtt 

ggtggatta't 



ttatttgtgg 

tggagttaga 

atttttgtat 

ttttggtttt 

gttataatgt 

ttttttaagt 

aagaattatg 

gttttgttat 

ttttgtaagt 

ttgtatgttt 

attttggttg 

taagaatttt 

taggtgtttg 

ttgttttggg 

gatagggtgt 

ttaaagaaat 

ggggttaggt 

gattttttga 

gtagttaatt 

ttttttgttt 

gtaagatggg 

gtttttgggg 

atatagttgg 

taagggaaag 

atgtagtttt 

gaatagttga 

tggtttatat 

taggagtttg 

aaaaaaaatt 

tgggaggatt 

attttagttt 

gggtatgttg 

ttgaggttag 



ataggttagg 

aagtttttag 

ttttagaaga 

ttgggataat 

ttatatattt 

tttattgtag 

tatttatgtt 

tgggttttgt 

atattgtgaa 

aggtgtatgt 

tgatgtttag 

tgtagtaggg 

atagtatttt 

ttttgttggg 

attagtttta 

agagaaatta 

tgggtatgta 

gtattaggta 

tattaagttt 

aattttagtt 

ttagtatgga 

tagtttttgg 

tgtttatttt 

atagaattat 

aaggatatta 

gggattttga 

ttgtaatttt 

agattagttt 

agttaggtgt 

gtttgagttt 

ggggaattga 

gtttatgttt 

gagtttgaga 



tatattt'tga 
gtaaaaagtt 
ttttttatat 
tttaggtttt 
ttggaattgt 
ttatagttta 
tattttttta 
taggagttgt 
ggggaagtaa 
gtatatattt 
tgtgtttaaa 
ttttaatgat 
tgggtaggta 
tattttgtaa 
gttgaatttt 
aattttgata 
taaattattg 
tatagttggg 
atgaagaggt 
tgagaaaggt 
tatagggttt 
aaattgattg 
•gaa'tttatta 
ttggtttttt 
tagaagtagg 
agagggattt 
agtattttgg 
gggtaatata 
ggtggtgtgt 
gggagattgt 
gtgagatttt 
gtaattttag 
. ttagtttgat 



gtttaatttt 

tgaagggggt 

taggaaagat 

ggtatgagtg 

tttgtagggt 

ttagttttgt 

gagttatttt. 

agttgtaggg' 

tgattagaga 

ttatataggg 

gagtgtaaat 

tttatgtgtt 

aaaggaagtg 

ggagggtggt 

gatgaagttt. 

ataggtttta 

tttgtgttta 

tatttgtttt 

tgttgttgat 

taggtgtttt 

ttgaggaatt 

tttttggttt 

ttgtttttgg 

tttgttgttt 

ggtaattgaa 

ttatttaaag 

gaggttaagg 

gtaagatttt 

ttgtagtttt 

agttatagta 

gttttaaaat 

tattttggga 

taatatggag 



5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480- 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7001 



60 
120 
180 
240 
■300 
360 
420 
480 
540 
600 
. .6.60 . 
720 
780 " 
840' 
900 
960 
1020 
■ 1080 
1140 
1200 
1260. 
1320 
1380 
• 1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
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aaattttgtt t.ttattaaaa atataaaatt agttaggtgt ggtggtatat gtttgtaatt 2040 _ . 

ttagttattt aggaggttga ggtaggagaa ttgtttaaat ttgggaggtg gaggttgtag 2100 

tgaattgaga ttgtgttatt gtattttagt ttgggtaata .agagtgaaat ttggttttaa 2160 

aaaaaaaaaa aattagtaaa attatatttt aattgtatat tttgattata gtattttagt 2220 . 

tgagttggag tgagggtttg ttttggagaa ggtagtttat tttttttttt tgttttggta 2280 

tggggttatg atttattgta gggtgagagg agtggagagt ggtgtatatt agtagtttag 2340 

ttattagtgg atagagtagt atttggagtt agttttttat gttttatata tagtgagaaa 2400 

aattattgtg atatgatgtt taattttgat ttaagttgta taaaaggtag ttttaggtta 24 60 

ggttttaatt tgttagaggt atataggtag ttttttggtg ggtttttgta tttgtttgtg 2520 

ttgtttggag atttggttta aagatttttt ttttttttga gatgaagttt tattttgttg 2580 

tttaggttgt agtgtagtgg ttggattttg gtttattgta agttttgttt tttgggttta 2640 

agtgattttt ttgttttaga tttttgagta gttgggatta taggtgtgtg ttataataat 2700 

atttggttaa tttttgtatt tttagtagag atgggatttt attatattgg ttaggttggt 2760 

tttgaatttt tgattttaag tgatttgttt gtttttattt tttaaagtgt tgggattata 2820 

ggtgtgaatt attgtatttg atttagagat ttttaatttg attatttatt ttttatttta 2880 

tttagggatt ggatttttgt tggaagggtg gagtgtggga tagggtagtt agggttttga 2940 

attgattttt. tttttttaga tttttttggt tttattgtat tagttttatt ttttgttgat 3000 

gttagatagg ttttagttag aatgtgagtg ttatagatat agttaagttt agtgttgatt 3060 

aatattttgt tttagaagaa tttttataag gttttttgta gaatgatttt gtgtttagtt 3120 
taggagagtt agggtttttt ttgattttgt tttggagttt ttttaagtat ttaaattatt ■ 3180 

tgatggggat aaatggagag gatagatgag ggagtagggt ggagtgtttt agtagaatgt 3240 

tttttattta gaatttgttg ttattttgta gttagtaagg atgtggggtt aag'aattaag 3300 

gttagggttt tataggaaaa aggtaaaggg ggaggggtgg gaatttaagt: ttattttttt 3360 

ttttaagtat ttaaaggttt tttggatgga gaagagtatt ggagtaaaaa ttttagtata 3420 

aattttattg gggatagtgg gtaattttgt tgggttagta aaaataaatg gtgtgggttt 3480 

tggaaaatga'gggttggagg ttgtgaataa agtagtggat gtgtttgttt agtatattaa 3540 

tgggaagaag tatttagatg ggaggagtat taggggtagg agaaatgtta gatagatttt 3600 

agtgttaggg taagaaggaa gattattttg tttgtagaat agggagggta tagggatggt 3660 ' . 

gttaatttgt ttttgtgatg gttttgagtt tttatttaat aatgagaaag tttgtttttt 3720 

tttttttttt tggatgattt aggagttttg ggttgggatg tagtgatttt atttttagtt 3780 

tttttttttt tggtgatgaa tttttttatt tttatttaga aaatagattt. ggattagagg 3840 

tattgtatag tttttttagg attttaaagg aggaagagtt tttttttttg tttttaaagt 3900 

' tgtttgttgg aagaggattt taatagttat tttagttgga tgtatagtag gattatggaa 3960 

tttttttttt- gtattatagg gatttat'ttt ttattttatt attgtttata aaaattgatg 4020 

gttttttttt tgagatagag ttttgttttg ttttttaggt tggagtgtag tggtgtgatt 4080 

ttggtttatt gtaatttttg ttttttgggt ttaagtaatt ttttgtttta gttttttaag 4140 • 

tagttgggat tataggtgtt tgttattata attggttaat tttttgtatt tttagttgag 4200 

atggggtttt attattttgg ttaggttggt tttgaatttt tgattttatg atttatttat 4260 

tttggttttt taaagtgttg ggattaaagg tgtgagttat tgtatttggt ttaaaattga 4320 

tgtttttttt.ttttttttta atatataatt tgggattttt tagtttttta tttttttttt 4380 _ 

tttttttttt' ttttttttga gatagagttt tgttttttta tttaggttgg aatgtagtgg 4440 

tttagttttg atttattgta atttttgttt tttgggttta agtgatattt ttgttttagt 4500 

ttttttagta gttgggatta taggtatata ttattatggt tagataattt ttttgtattt 4560 

ttagtataga tggggttttg ttatgttggt ttggtaggtt ttgaattttt ggttttaagt 4 620 

gatttgtttg ttttggtttt ttaaaatgtt gagattatag gtatgagtta ttaagtttag 4680 

tttttttttt tttttttgag atagagtttt attttgttat ttaggttgga gtgtagtggt 4740 

atgattttgg tttattgtaa tttttgtttt ttggttgaag tgatttagtt ttttaagtag 4800 

ttgggattat agttatatat tattatgttt ggttaatttt tgtatgttta gtagagatag 4860 

ggttttatta tgttggttag gttgattttg aatttttgat tgtaaatgat ttatttgttt 4 920 

tggtttttta' aagtattggt attagaggtg tgagttattg tatttggttt ttttttttat 4980 • 

ttttgagata gagttttatt ttgttattta ggttggagtg tagtggtatg attttggttt 5040 . ■ 
attgtaattt ttgtttttta ggtttaagtg atttttttgt tttatttttt taagtagttg • 5100 ■ 

ggattatagg tgtgtatttt tgtggttagt tttttttttt aattggttag tgttttgtgg 5160 

tttttttatt tttttatagt ggaaaatggt . ttaggattga ttgatatgaa gataagttta 5220 
ggggtttata tttaatttaa tttttgtatt taagttttgg gttaagattt tggtgtgttg • 5280 

agtattattt attttgtaag gaattttgta aaattttatt tgaagtatta tttataattt 5340 

tatttttttt atttaaataa ggatttttgt ' tttatttttg ttaggtatat tgagttttat 5400 

agt'ttttgtt tttttttttt ggtgtttagg tttggttttt tgagtttggt ggttatatta .5460. 

' atggtatttg gtatatagtt ttttgataat ggggatattt aggaggtttt gagatatttt 5520 

atagttttgg gttagtaatt. tggatttttt tttttatttt tttaggtatt ttataattta . 5580 
gttttttttt tttttgtggg taaagtgttt ttgaatgttt atggtttaaa ataagatttt ■ 5640 

ttttttattt atttttaaat ttttttttag atttatttta gaggaaggga atagaatttt 5700 
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ttatatttta 

tatataggag 

tattttttaa 

gagggagtag 

tttagttata 

ttgtgtttaa 

tggaattggg 

ttatgtagtt 

ggtggagttt 

ttttggtgtg 

tttgagtttt 

gaaggtgttt 

agttgttaat 

tttgggttag 

tttgttttat 

attttattgt 

gttagtgttt 

tgggaagtgt 

ggagtatttg 

ttttgtgttg 

tttgagaatg 

gtgtttatta 

taatagttta 

gagaggtttg 

ggaagtttta 

ggttatttta 

tggaaggtta 

tttttttttt 

ttatttaggt 

ttaagtgatt 

gtttagttaa 

tttaaatttt 

ggtttgagtt 

tatttgaagg 

ttattttttt 

ttttttattt 

tttagtgatt 

tagtattatt 

tttatttttt 

atgatgaagt 

agatttatgg 

tattaggatg 

tagttttatt 

tagttattgg 

taaat'atgtt 

tgttgtaatg 

attttgttgt 

ttgtgtttaa 

tattatgttt 

taggttggtt 

gggattatag 

attttgtgtt 

atgtatatat 

atgtttaatt 

* tagggtttat 

• ggattgtaaa 
gtttgttggt 
tt'agtatttg 
tttgtttgta 
taattttttt 
ttgggaagat 

. ttttaaagat 
atttgtggtt 



gtagttggtg 

tgtagatgga 

tttttggtga 

gagtatttga 

gttggataat 

tttgtttgtg 

gttttatgta 

aggagggtgg 

tagtgattgt 

gtttgtgagt 

tttggttgta 

ttgttttgtt 

tattgttttg 

agttgtgttg 

tggggtagga 

tttggagttt 

gggtaagtgt 

gtgtgtgtgg 

tggagttggt 

ttgggaaatt 

tttgtattgt 

tgtttgtgtt 

ggtagatgag 

gtggtgaggt 

gggagagggt 

tattgttatg 

taggtagaga 

ttttattggt 

tggagtgtaa 

tttttgtttt 

tttttgtatt 

tgattttagg 

attgtatttg 

aaaaattaaa 

tagattaaga 

attttagtta 

tgtgtattat 

tgaagtttgt 

gtagtttttg 

ttgggtttag 

ttgaaatatt 

tttaaaagat 

gttatatttg 

agataaaagt 

taaattattt 

ttttagtttt 

taaggttgga 

gtaatttttt 

ggtttatttt 

ttgaattttt 

gtatgagtta 

ttttttagtt 

agatttggga 

tagattttat 

ttttagtagt 

agtatttttg 

gatttttagg 

atttattggg 

gggggtttat 

ttattttaaa 

ggattttgag 

aagaaattta 

tagatatgag 



ataggttaga 

ggagtaggat 

ggattttttt 

atgtggaaat 

gtttggttta 

atagaaatta 

tggttttgtg 

ggttgtttat 

tttttttttt 

tagtgaggtt 

gtagttattg 

tatagttgta 

gttgtgtgtg 

taattgttaa 

atgttggggt 

tatttataga 

ataagagtgg 

agtttgggtt 

aatttaggtt 

ggtaattaga 

ggtttgttta 

ttgttttatt 

tgtgtggtag 

ggaggagttt 

tgttgtttgt 

tggattttaa 

ggtttgttta 

aattaaaaaa 

tggtgtgatt 

agttttttga 

tttagtagag 

tgatttattt 

gttttttatt 

tatttttaat 

tattggtttt 

ggtttgtttt 

taaatggaat 

tttttttgat 

tttagttttt 

ttttgattat 

gatttaaatt 

gttttaagtg 

tttagttttt 

ttaaatttat 

tgttttttat 

gttttttttt 

gggtagtggt 

tattttagtt 

tttttatttt 

gattttaggt 

ttgtgtttgg 

gaatttgtga 

tattatttat 

gttataataa 

tgatttaata 

gtgattttag 

ggtgtagaag 

ttagaaaagg 

taaaggtttt 

ggttttggag 

gatagtaggt 

aaattagtgt 

tggagtgtga 



atagggaaga 
ttttttttgt 
ttttttatat 
tgaggtgtta 
ggtttattat 
agggggtttt 
ggtaggtttt 
tgatttagga 
gtttgttggt 
tgtgtggtga 
tttgttgtgt 
tttgtttgtt 
tgtgtgtatg 
gattgaaatg 
ggggatatgt 
ttttatttat 
tttttggttg 
agtttgggtt 
ttttttttag- 
atgttttttg 
tttttgtttt 
gtgggttttt' 
tggtagtggt 
taggttgggg 
taggttttgt 
tgttggttat 
atagttggat 
ataataataa' 
ttggtttatt 
gtatttagga 
atggggtttt 
gttttggttt 
gggaatgtat 
ttatgtttgt 
tatatatttt 
tttttttagg 
ttagtgtttt 
ttttaggggt 
ttgtagattt 
tggtttggtt 
tttagattag 
aatatggtta 
tttttgtagt 
ttttgatttt 
ttttatggtt 
tttttttttt 
atgattttgg 
ttttgagta'g 
ttagtagata 
gatttatttg 
ttttttgttt 
tgtttttttg 
ttataagttt 
tttggggagt 
ggtttgtggt 
ttggtgttta. 
gattgttggg 
gtgtttgtta 
aaattttttt 
ggggttatga 
ataaattttt 
atttttatat 
taagggataa 



ggtgagggtt 
tttttaagtt 
ttttttttag 
gtttaaattg 
aagttatata 
ggtatttagt 
tggttaggat 
tgtggtaatg 
attttttggt 
agtattgttt 
tgttttaggt 
ttttagtttt 
tgtgttagtg 
tagattgttg 
atgttttgtt 
tatagggaat 
gaggtgaggg 
tgggtttgtt 
tttttgtgta 
tgtgtggtat 
ttttatatgt 
agtttaggtt 
aggtgaattt 
aaatgttttg 
aggtttgatt 
ttgggtgttt 
ttttattgtt 
aaaattgtgt 
gtaatttttg 
ttataggtgt 
attatgttag 
tttaaagtgt 
atggaatata 
tttgtggttg 
aattttttgt 
aaattgtttg 
attttttatt 
tatatttttt 
tgatttaatt 
tgtttatatt 
attttttgtg 
aaatttaatt 
aaaaatggtt 
ttttttgttt 
attttatttt 
tttttttgag 
tttattgtaa 
ttgggattat 
tgaggtttta 
tttttgtttt 
attttttgta 
tattggatga 
ttaaataggt 
ttttaaaatt 
gggatttagg 
tttaggggag 
tgtgtggttg 
aataaagatt 
aggttttttt 
gtgtttgaga 
ttttaagaag 
ataagtagtt 
attatttttg. 



tagttggttt 
tttttaaata 
tttttttaag 
tttggttggt 
gttgtttttt 
atttaggtgg 
ttgtggggag 
gattggggag 
ttttatttgg 
gagttttgag 
tattttgaaa 
gtgtgtttgt 
tgtgtgtgtg 
ggatttagtt 
tttaggaatg 
gggggtgggt 
tgggaaggtg 
tgtagtgggt 
gaattagttt 
ttaggtagtt 
ttttggtttt 
ttggggtttg 
gtaatttgta 
gagattgaag 
tattttagtg 
ggaaattggt 
tagtatagaa 
tttgtttttg 
ttttttgggt 
ttgttattat 
ttaggttggt 
tgggattata 
'tttgtttatt 
ttatttgttt 
ttttattttt 



ggatagggtt 

tttatttttt 

tagttttttt 

tttatatttt 

tatttgtttt 

ttagtatttt . 

tttttttttt 

attaggtttt 

ttatttttga 

tttttgagaa 

atagagtttt 

tttttgtttt 

aggtatttgt' 

ttatgttggt " 

ttgaagtgtt 

ttttgttata 

gagggttttt 

tagagtagtg 

tattgatgtt 

ttagtgggga 

agtaattttt 

tgtgtatatt 

taataaaatt 

ttataggtgg 

agaggtaagt 

ggttaaggta 

atttttgttt 

tgtatttttt 



5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 " 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
'7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 . 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000' 
9060 
9120 
9180 
9240 
9300 
• 9360 
9420 
9480' 
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agtgatgggg 
ttgtgatttg 
gtgtggggtt 
ttgttataga 
tggttaggga 
ttggtatgtt 
agtttaatga 
gggtgttagt 
tttaggattt 
gtgtagggat 
tttgtttttt 
tggtgattgt 
tagaggaaga 
atatttggtt 
gagggagatt 
ttggtagagg 
gtatgggtgg 
taggatataa 
tgggtattga 
ttttaggata 
aagtttttgt 
ttagtttgta 
ggtgggggta 
ttataggttt 
ttggagttgg 
ttagattttt 



tgaaagtaat 
atttgtggtg 
attatttatg 
ggttgtatta 
ttagttgttg 
ttagttgggt 
gtttaggttt 
aagtgtgggt 
agattgggtg 
tgtggggttt 
tttttttttt 
aaagtattat 
gggtttttat 
tatttattgg 
ggttagtagt 
ttggattgtg 
ttggttttta 
gtttttttat 
ggatggtaat 
tttttatttg 
attgtagtaa 
ggttttagtt 
ttttatttat 
tatttttgtg 
tgtttggtaa 
gggatttagg 



ggatttagtt 
attgtgttgt 
tgtgtattgt 
tgtgtgtgtg 
tgtggggttg 
gtggtttttt 
ttttgatatg 
tgggtggggt 
gtgggaagga 
gaaggtgggg 
ttgttttttt 
gtggagatat 
atttggtttt 
gtggggtagg 
tttattgttt 
attttgaggt 
gtaattgtag 
gtttttttta 
gtatgtgggg 
tagtaatatt 
gtatagtggg 
tatttggggg 
atttatgttt 
tttgtagttg 
ttgtttgtta 



tttgggagtt 
tttttggttg 
aggtttttgt 
gtgggttttg 
tatgtggtgt 
ttagtgtgtt 
gtttggttgg 
tatagggtgg 
gttgaggaga 
ttgggtgtgt 
tttatatttt 
ttgtgttttt 
ggtttttttg 
aagttttgag 
gttttgtttt 
ttaggggtgt 
tttttatttg 
gtgtttgatt 
gatgtgggag 
atttattttg 
gttgttttgg 
aaagttagga 
tttgtatttt 
taggttttgt 
tttgtttttg 



gtttttgttg 
ttttttttgt 
gtatgatgtt 
tgggttggaa 
tttgtgtgat 
tagtgggtgt 
gtttgtgttt 
gttttgattt 
gttgtgtaat 
ttttgtagag 
attttggatg 
ggaggttagt 
gtttggtttg 
tttttatttg 
ttattgtgga 
attttgggtg 
gttttgttat 
tggtattttt 
tagggtttag 
gtattgtgag 
agttattgtt 
aggtttgatt 
ttttattttt 
tttgaggggt 
tttttttgtt 



attttttttg 
ttttgtaggt 
ttagatgaag 
gtggtggtta 
gtgtagtgtg 
tagtttttgt 
tgttggtttt 
tagtgttttt 
ggaaatttgg 
ttttttttgt 
gttataatga 
tttattgtgt 
ttgaagtaat 
gggtgaggag 
gattggggtt 
gatttttttg 
tttgggttgt 
tgtaggtagg 
aggtttaagg 
tagtgtttag 
tttagtatat 
ggttttggaa 
tttgttattt 
tgaatatatg 
ttagttgttt 



<210> 25 
<211> 11001 
<212> DNA 

<213> Artificial Sequence 
<220> 



<220> 

<223> chemically treated., genomic DNA (Homo sapiens) 



<400> 25 

ttttagattt 
attgttaagt 
gtgtgaaggt 
tgtgggtggg 
gggttgagat 
tttattgtag 
gtaggtgagg 
tattgttatt 
tatggaaaga* 
ttgagggttg 
ttatggttta 
agttgttgat 
tttagtgggt 
tgtggggatt 
tgtggtgttt 
aaggagggaa 
tgggttttat 
ttgtttagtt 
ggtttgtgtt 
ggagtttgag 
tatttggttg 
atggtagttg 
gtggtgtgat 
gtgtgagtag 
ttgttgtggg 
tgttattttt 



tagaaatttg 
attagtttta 
aaggtttgtg 
gtgtttttat 
ttataggttg 
tgtagaggtt 
gtattttaga 
tttaatattt 
tttgtgtttt 
attatttatg 
gtttttgttg 
tgattttttt 
aggttaagtg 
tttttttttt 
tgtggttgtt 
ggaagggtaa 
agtttttgta' 
tggattttgg 
tgttagtgtt 
tttattgagt 
gagtgtgtta 
gtttttggtt 
ttttgtggtg 
tggttttgtg 
ttaagttgtg 
gttttattgt 



ggagtggttg 
gtatgtgttt 
gaaatggtag 
•tfetttagggt 
gatgtgttag 
tttaagtgtt 
attttggatt 
atttgtttgt 
ggtagtttag 
ttaagggaat 
gagttttagt 
tttttttatt 
tgttgtttta 
agtatagtaa 
gttgttgtgg 
ggtggggggg 
tttaggtttt 
gggaggtgtt 
taaagttagt 
tgtgggagtt 
atgtgttgtg 
gtggttattg 
attttatttg 
tgtttatgag 
gtagaggggg 
tgaagagtgt 



gagtgagaaa 
agttttttag 
ggagggtgga 
tagttagatt 
aggtagtggt 
gtttatgatg 
tttaagtttt 
agggagtgtt 
ggtgatagag 
ttatttagaa 
ttttgtagtg 
ttaagtgaa'g 
gtaaattgga 
agttggtttt 
ttgtttgggg 
gttttgtgag 
tattgtgtgg 
gaagttgggg 
gaagtatggg 
ggtatttgtt 
tattgtgtgg 
tttttagttt 
gggtgttgtg 
agtggaaggg 
ttggtgggga 
gtgaaaatgg 



atagaggtaa 

agtaggattt 

ggggatgtag 

tttttgattt 

tttagagtgg 

ttagaatgag 

atttttatat 

aagttaagta 

ttaaatgagg 

tgtatttttg 

gagagtagag 

gtttgagatt 

ttaggagggt 

tagaaatatg 

tggggtgtga 

agtgtgttta 

ttttttttag 

tttgttttgt 

tttaattggg 

gggtgtgttg 

ggtattgtgt 

gtggggtttg 

tgtaaaggtt 

gtagttaagg 

tagtttttga 

tttatttttt 



gtggtaggta 

gtggttgtag 

gaggtatgga 

ttttttaggt 

ttttgttgtg 

tggtattgtt 

tttttatatg 

ttgggaaaag 

gttgtagttg 

aattttaaga 

tgggtggtaa 

ttttgtttta 

tagggttgga 

ggtatttttg 

ggaggggatg 

gttttgtttt 

tttttttttg 

ggttttgttt 

ttatgttggg 

ggaagggttg 

gtaattttat 

ttatgtatat 

tgtagtgtgt 

ggtagtgtag 

ggattaggtt 

gttgtatttt 



9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
10920 
10980 
11001 



60 
120 
180 
.240- 
300 
360 
420 
480 
. 540 
600 
660 
720 
780 
840 
900 
960. 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
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atttgtattt gggttataga tgagtagagg" tggttgttta tatgtaaaaa tatgttgatt 1620 

ttaagttttt tatttttaaa atgttttggt tttttttgag aaagggtttg tgtttattgt 1680 

ttttggagtt tattttttta ggtttgtttt tttttaaata tttatgattt tttttagaat 1740 

ttttagggtg aagggaaatt attatttatg ggagggagtt tggaaaaatt tagaattttt 1800 

ggtgggtttt. ttgtaagtag gagttttgtt gagtttttat ttagtaaata tttttttttg 1860 

atttagtgaa ttagatgtta aaatatgtat gtagttatat atttagtagt ttttttgtat 1920 

ttttgggaat tgttagtaag taaaggttgt ttttttttgg gtagatatta gttggaatta 1980 

ttaggggtgt ttttatagtt ttttttgtta gtttggattt tattgtagat ttgttgaatt 2040 

aattgttggg agtggatttt aggtattagt aaattttaaa aattttttaa attattgtaa 2100 

tatggagttt gggttgagta ttattgtttt ggtttattta ggaatttgtg gatggatagt- 2160 

gttttaggtt tgtgtgtgta tggagatttt tttatttggt ataagaggat attataaatt 2220 

tagttggggg gagtataaag ttgtgataga atgtaaagaa tgaataaggg gttgagtgtg 2280 

gtggtttatg tttgtaattt tagtattttg gaaggtggag gtgggtggat tatttgaggt ' 2340 

taggagttta agattagttt ggttaatatg. gtgaaatttt atgtttatta aaaaataaaa 2400 

aaaaatgagt taggtgtagt ggtgggtgtt tgtaatttta gttatttggg aggttgaggt 24 60 

gggagaattg tttgaatata ggaggtggag gttgtagtga gttgagattg tgttattgtt 2520 

ttttagtttt ggtgata.gag tgagattttg ttttaaaaaa aaaaaaaaaa aaaaaaagaa 2580 

taaggttggg atattgtagt gtttttaaag agaaataaag tagttatgga gataagaagt 2640 

aggatgattt gggtatgttt attagaggta gagataaggg agaaattaaa gataagtttg 2700 

ggtttttgtt tttagtaatt gggagtttag tggttatttt tgttgtaaag aggaagttgg 27 60 

gtaagtgtag tagtgaggtt gaagaaaagg gaattaaatt ttggttatgt ttatttgaaa 2820 

tgttttttag atattttagt gaaggtattg gtatggagga tttagtttga gggtttaggt 2880 

tagtgtttta gttgtggatt tggggtagat gaatgtagat agattaggtt agtgattagg 294 0 

attgagttta gattttattg* tgagatatgg aagttgagtt agaatttgta aaggagttga 3000 

gtaggagttg tagggggtag gaggaaaatt gggagagtgt agtttttggg agttaaaggg 3060 

agtaagtttt aaatgatgtt gagggggtga gaatggagaa tggaatattg gattttattt 3120 

ggtagtatat agattgttga- ggattttgtt ttgggtagtt ttttggagga agaggtaagt 3180 

ttggttggag tgggtagagg ggagagtgaa ggtgaaggat tagagtgtat ' agagattagt 3240 

gttttggttt gaggggagta gagataggtg ataattatag ggtagatgta ggttaaaggt 3300 

gtttagtttt ttttttaagt aaatgggtag atgtatttta tatatgtttt tagtgaaggg 3360 

ttgggtgtgg tggtttaagt ttgtagtttt agtatjtttgg aaggttgagg tgggtggatt 3420 

atttgagatt aggagtttga gattagtttg gttaatatgg tgaaattttg tttttattaa 3480 

aaatataaaa attagttggg tatggtggtg ggtgtttgta attttaggta tttaggaggt 3540 

tgaggtagaa gaattg^ttg aatttaggag gtggaggttg tggtgagttg aaattgtgtt 3600 

attgtatttt agtttgggtg ataaaagtaa gatgtagttt tttgttgttg tttttttaat 3660 

tgttaatgag gaaaggggaa gttttgtgtt aggtgataga gatttaattg ttgagtaggt ' 3720 

ttttttgttt gtggtttttt' ggttggtttt tagatgttta ggtggttaat attagagttt 3780 

gtgtagtagt gtgaggtaat ttattgagat aggttgggtt tgtggagttt ggtgagtagt . 3840 

ggtttttttt ttggggtttt tttttaattt ttgggatatt ' tttttgattt ggagtttttt 3900 

tgttttattg ttaggttttt ttgtagattg taagtttatt tgttattatt gttgttgtgt ' ' 3960 

gtttgtttg't ttggattgtt . gtgggttttg "ggatttg'g'gt tgggaatttg tggbggagtg " 4020 

ggatatgaat gtggtgagtg . tggggttgag ggtgtatggg aaggttgagg atgggtaggt 4080 

tatagtgtag gtatttttga gggttgtttg ggtgttgtgt gtaaggagtg ttttaattgt 4140 

tgattttttg gtggtataga gaggttaatt ttgtgtgggg gttgggaggg gagtttggat 4200 

tgttggtttt gtaagtattt tatttgttgt aagtggattt gggtttaggt tgatttaggt 4260- 

tttgtgtatg tgtatttttt gtattttttt gtttttgttt" ttggttagag gttatttttg 4320 

tgtgtttgtt tggatgttgg tatttgtttt tgttttttgt ggtaggtggg gtttgtgagt 4380 

ggagttttgg agtgatgagg ttatttttgg gggtgaagtg tgtgtgtttt tgttttggtg 4440 

tttttgtttt aatgagataa gagttagatt ttggtgattt atgttttagt tttaatggtt 4500 

gtggtgtggt tttggtttgg gtgtatgtgt atattgatat gtgtatatgt atgtatgtga 4560 

ttggggtggt ggttggtggt tatggatgtg taggattggg ggatgggtgg gtatggttat ■ 4 620 

gggtgaggtg gaggtgtttt tttttgaaat gatttggagt agtatgatga gtagtggtta 4 680 

ttgt'agttaa gaggatttgg atttggagtt tgagtagtat tttattgtgt gaattttgtt 4740 

agtttgtagg ttgtgttggg attaggtggg agttaggggg tgttggtggg tgggagggga 4 800 

agtggttgtt ggagttttgt tttttttggt ttgttgttgt gttttgggtt ggtgggtagt 4860 

tttatttttt tggttatgtg gttttttgtg ggttttggtt ggggatttgt ttgtggaatt 4 920 

gtgtgtaaga ttttgatttt attgtttaga tgttgggtgt tggggttttt ttggtttttg 4980 

ttatagatag gttgaatatg gaaaaagtag ttgtatggtt tgtggtagat .ttgagttggg 5040 

tattatttag ttatgattaa agttgattga gtagtttgga ttagtatttt gatttttgtg 5100 

tttgaatgtt tttgtttttt ttttggggag attaggggag gatgtggaga gggaag'agtt 5160 

tttgttagga attgagaagt atgtttagga aaatttgaga ggtagagaga gattttgttt 5220. 

ttttatttgt atttttgtat ggagttagtt gagtttttat tttttttttg ttttggtttg ' 5280 
ttattagttg ttggaatgtg gaagattttg tttttttttt ttagggtgga tttggagaaa ..-5340 



gatttgggaa tagataggaa agaagttttg ttttggatta taagtattta ggagtatttt 5400 

atttatagga agggggaaag ttagattata aaatgtttaa agaggtggaa aaagagattt' • 54 60 

aggttattaa tttaggattg taaggtgttt tggaattttt taggtatttt tattattgga 5520 

gaattgtgtg ttagatgtta ttggtgtgat tattaggttt agagaattag gtttaggtat 5580 

taggaaaaag aaatagggat tgtgaagttt agtatgtttg gtagaaatgg ggtggaaatt 5640 

tttatttaag taaaga'aagt ggagttgtga gtgatgtttt agataaaatt ttataaaatt 5700 

ttttataaaa tgggtggtgt ttagtatgtt aaaattttag tttagagttt gggtgtaagg 57 60 

gttgagttga gtgtagattt ttgggtttgt ttttatgtta gttagttttg agttattttt 5820 

.tattgtggaa aggtgggaaa attataagat attaattaat tgaaaaggag ggttagttat 5880 

ggaggtgtat atttgtaatt ttagttattt gggagggtga ggtagaagga ttatttgaat 5940 

ttgggaggta gaggttgtag tgagttaaga ttgtgttatt gtattttagt ttgagtgata . 6000 

gagtgagatt ttgttttaaa aatagaaaag gaagttaagt atggtggttt atatttttaa 6060 

tgttaatgtt ttgggaggtt aaggtaggtg gattatttgt aattaggaat ttgaiggttag 6120 

tttggttaat atggtgaaat tttattttta ttaaatatat aaaaattagt tgggtatggt ' 6180 

ggtgtgtgat tgtagtttta gttatttggg agattgaatt attttaattg ggaggtaaag 6240 

gttgtagtga gttaagattg tgttattgta ttttaatttg ggtgataggg tgaggttttg 6300 

ttttaaaaaa aagaaagaag gttgggtttg gtgatttatg tttgtaattt ' tagtattttg ■ 6360 

ggaggttaag gtaggtagat tatttgaggt taagagtttg agatttgtta ggttaatata 6420 

gtaaaatttt gtttgtattg aaaatataaa aaaattattt ggttatggtg gtgtgtgttt 6480 

gtaattttag ttattgggga ggttgaggta ggagtattat ttgaatttag aagatagagg 654 0 

ttgtagtgag ttgagattgg gttattgtat tttagtttgg atgagagagt aagattttgt 6600 

tttaaaaaaa aaaaaaaaaa aaaagaaaga ataggaggtt gagaagtttt aagttatatg 6660 

ttaaaaaaaa agaaaaaaat attagtttta ggttaggtgt agtggtttat atttttaatt - ■ 6720 

ttagtatttt ggaaagttga ggtgggtgga ttatgaggtt aggagtttaa" gattagtttg 6780 

gttaaaatgg tgaaattttg ttttgattaa aaatataaaa aattagttag ttgtggtggt 68 4 0 

aggtatttgt aattttagtt atttgggagg ttgaagtaga gaattgtttg aatttaggag 6900 

gtagagattg taatgagtta agattgtatt attgtatttt agtttggaaa atagagtgag 6960 

attttgtttt aaaaaaaaaa ttattagttt ttatggatag tggtagagtg gagggtgggt. 7020 

ttttatggtg tagaagggaa attttatggt tttgttgtgt atttgattgg gatggttgtt .7080 

gaaatttttt tttagtaggt agttttggaa atagaaaaag aaattttttt ttttttagaa 7140 

ttttggaagg gttgtgtagt- gtttttaatt taagtttgtt ttttgagtga agatagggag 7200 

gtttattatt agaagggaag gggttggaaa tgaggttatt gtattttagt ttagggtttt 7260 

tgggttattt aggaagggaa gaaggagtaa gtttttttat tgttaggtag gagtttagag 7320 

ttattataag aataagttag tattattttt gtgttttttt tgttttgtaa ataaaatgat 7380 

tttttttttt gttttggtat tagagtttgt ttggtatttt ttttgttttt agtatttttt • 7440 

ttatttgggt attttttttt gttggtgtat tgaataaata tatttattgt tttatttata 7500 

gtttttagtt tttatttttt agggtttata ttatttgttt ttattaattt gataaggttg 7560 

tttattgttt ttagtaag'gt ttgtattggg gtttttattt tagtgttttt ttttatttag 7620" 

gagatttttg gatatttggg gaagaaaatg agtttaaatt tttatttttt tttttttatt ' 7680 

ttttttttgt aaggttttgg ttttagtttt tagttttata tttttgttgg ttgtagaata 7740 

gtagtgggtt ttgggtaagg agtattttgt taaaatgttt tattttgttt ttttatttgt ■ • 7800 

tttttttatt tgtttttatt agatggttta agtgtttaag gggattttag ggtggagtta 7860 

gggagaattt tggttttttt gggttaggta taagattatt ttataggaaa ttttgtggga 7920 

atttttttgg gataaagtat tggttagtgt tgagtttagt tgtgt.ttgtg atatttgtat .7980 

tttaattagg gtttatttga tgttaatagg aagtaaggtt gatgtagtgg ggttaaggga 8040 

gtttgggaga agaaagttgg tttagagttt tggttgtttt gttttatatt ttattttttt. 8100 

ggtaagaatt tagtttttag atgaggtggg gagtgagtgg ttgagttaaa aatttttggg . 8160 

ttgggtatga tggtttatgt ttgtaatttt agtattttgt gaggtgaagg taggtggatt 8220. 

atttgaggtt aggagtttaa gattaatttg gttaatgtgg tgaaatttta tttttattaa 8280 

aaatataaaa attagttggg tgttgttgtg gtatgtgttt gtagttttag ttatttggga 8340 

gtttgaggta ggagaattgt ttgaatttag gaggtagaat ttgtagtgag ttaagattta' 8400 

gttattgtat tatagtttgg gtgatagagt gaggttttgt tttaaaaaaa aaaaaaattt 8460 

ttgggttaaa tttttagata gtataggtag gtgtagaaat ttattaggaa gttgtttgtg - 8520 

tatttttggt agattggagt ttggtttaaa gttgtttttt atgtagtttg ggttaaggtt ■ 8580. 

aaatattatg ttatagtgat tttttttatt atgtgtgaga tatggagaat tggttttaag 8 640 

tattattttg tttattggtg gttggattat tgatgtgtat tattttttat tttttttatt 8700 

ttgtagtggg ttatggtttt gtgttggggt agaggagaaa aatgggttgt tttttttagg 87 60 

ataaattttt attttaattt aattagggtg ttgtgattag aatgtgtaat t'gaggtgtga . 8820 

ttttattgat tttttttttt tttgagattg agttttgttt ttgttgttta ggttggagtg 8880 

tgatggtatg attttagttt attgtaattt ttattttttg agtttgagta atttttttgt 894 0 

tttagttttt taagtagttg ggattatagg tatgtgttat tatgtttggt taattttgta 9000 

tttttagtag agatggggtt tttttatgtt ggttaggttg gttttaaatt tttgatttta \ 9060 

ggtgatttat ttgttttggt tttttaaagt gttagaatta taggtgtgag ttaatgtgtt '/ 9120 
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tagtttgttt ttgttttttg tgttttgaag tagggtttta tttagttttt taggttggag 9180 

tgtagtgata tgataatagt ttattgtagt tgtaattttt tgggtttaaa tgattttttt 9240 

attttagttt tttgaatagt tgggattata ggtatattat tatatttggt taattttttt. 9300 

tttttttttt ttagtaga'ga tgaggttttg ttatgttgtt taagttggtt ttaaattttt ' 9360 

gaggattaag tgattttttt attttagttt tttaaaatgt tgggattgta gatgtgagtt 9420 

attatattta gtttgatttt attttaaatg agagtttttt tttagagttt tttagttgtt 9480 

tttggttttt ggttatgtgt ttttagttgt ttttgttttt gtggtatttt taaggttata 9540 

tttagtgttg aggttttagg taggtagtag agagaagtta aatgattttg tttttttttt 9600 

atttatttag agtatgtaaa attaggagta gtggtgggtt tagggtgggt attagttatg 9660 

ta'tatgtata ttagggatag ggggttaaag gtagttagtt tttaaagatt gttttagagg 9720 

ttatttttta gagaagtttt gggtttttta agggttttgt gtttatgttg gtttattttg 9780 

taggatgagt ttgtggagtg ggagatattt gatttttttt aagttgagat tgagtagaag 9840 

attaaggagt ataatgftta gattaatagt aattttttta tgagtttggt gagttgattg 9900 

tttaggaagg gggtgtgggg aggagtaggt atttagttat gtgtttgata tttagagggt 9960 

tataattgag gttattttgg gtgggtgtaa gtagtaattt gtgtatattt agtttagttt 10020 

taagtagatt gatattttat ttggaattta ttattaaggt ttggtttttt tattttttta 10080 

gaataaggat ggtttttata taggttttat taaggtttag ttgaagttgg tgtgttttgt 10140 

ttttgtgttt tttagtaaga agttattttt tttgtaggat gtttggtggg gtttaggatg 10200 

gggtataagt gttaggtgtt gtattttttt ttatttgttt aaggatgttg ttaagtattt 102 60 

gtatgtgttg ttatgtataa gggtatgtga agttattgag gttttgttgt gaaagttttt 10320 

ggtggtggat gatttttgta agtttgtatt ttttgagtgt gttgagtgtt atggttaagg 10380. 

tgggtttttt attttatttt gttttatgtg agggtatata tgtatgtatt tgagtatgta 10440. 

ggggttgagt agttggtttt gtttttgatt attatttttt ttttatagtg tatttgtgga. 10500 

agttgttgga tgatgagtag tttttgtggt tgtggttttt ggtagggttt agtgataagg 105 60 

ttttgagttt tgttttgaag gaaaatgatt ttggggaggt gaatgtgagt atatagtttt 10620 

tagttttttg gttgttatta gataggattg atgggttgta gttatagtaa ggtttggagg 10680 

aggaattgtg ttggaagata agttttgtaa aatagtttta ggagtgtata ggtattgtaa 10740 

ttaaagtaaa ggtttttaga ttatttatgt taaagtttag ggttgtttta agaagttagg 10800 

aagaattgtt ttggtgtttt gatttttttt ggtgtggaaa attttttgga gatgtaggag 10860 

tttatttaat gatatgagga ggtttttttt agatttttta tttggaagtt ttttggtttt . 10920 

aaggtattag gtttgtggag tgaaattaga tttagaatat gtttgatttg tttataggta 10980 

attggggaat atttgatttg g 11001 



